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IN TH 

REQ! 

ant to 37 CFR 1.60, please file a Atty Dkt: 1487-17 

j f3fotinuation/l53 divisional C# M# 

^gfte-pending prior PATENT APPLICATION of: 

=S|emor: MAERTENS, et al. Date: September 12, 1997 

^prialNo. 08/612,973 Group: 1815 

' Filed* March 11, 1996 Examiner: Woodward, M. 

For: PURIFIED HEPATITIS C VIRUS ENVELOPE PROTEINS FOR DIAGNOSTIC AND THERAPEUTIC USE 

Hon. Commissioner of Patents and Trademarks 

Washington, DC 20231 

Sir: 

This request for filing under Rule 60 is made by the following named inventor(s) (using the above-identified title): 
Inventor(s): MAERTENS, et al. 

g| Attached is a true copy of the prior application as originally filed including the specification, claims, Oath/Declaration 
and drawings (if any) and abstract (if any). No amendments (if any) referenced in the Oath or Declaration filed to 
complete the prior application introduced new matter. 

g] Priority is hereby claimed under 35 USC 1 1 9 based on the following foreign applications: 

Application Number Country Day/Month/Year Filed 

PCT/EP95/03031 Pet 31/07/1995 

94870132.1 EP 29/07/1994 

□ certified copy(ies) of foreign application(s) attached or 

□ already filed on in prior appln no. filed 



already filed in 08/612,973 filed March 11, 1996 



_£-j please amend the specification by inserting before the first line: This application claims the benefit of U.S. 
2 Provisional Application No. , filed 

The prior application is assigned to Innogenetics NV, Ghent, Belgium. 
H Power of Attorney has been granted to Thomas E. Byrne, et al, Reg. No. 32,205 of Nixon & Vanderhye P.C., 1 1 00 
^ North Glebe Road, 8 th Floor, Arlington, Virginia 22201 . 

0% Address all future communications to: Nixon & Vanderhye P.O. 1 100 North Glebe Road, 8 th Floor, Arlington, Virginia 
H 22201. 

U!|2 Please amend the specification by inserting before the first line -This is a divisional of application Serial No. 
\j 08/612,973, filed March 11, 1996.™ 

* IS "Small entity" statement of record. g| "Small entity" statement attached. 

Petition filed in prior application to extend its life to insure copendency. 
■S [x] The Examiner's attention is directed to the prior art cited in the parent application by applicant and/or Examiner for the 
ft reasons stated therein. The Examiner is requested to acknowledge consideration of same by returning an initialed 
K copy of the attached PTO 1449 pursuant to MPEP §609. 

1 213 Please enter the attached and/or below preliminary amendment prior to calculation of filing fee: 

FILING FEE IS BASED ON CLAIMS AS FILED LESS ANY HEREWITH CANCELED 

Basic Filing Fee 

Total effective claims 10 - 20 (at least 20) = 0 x$ 22.00 
Independent claims 4 -3 (at least 3) = 1 x$ 80.00 

If any proper multiple dependent claims now added for first time, add $260.00 (ignore improper) 

SUBTOTAL 

If "small entity," then enter half (1/2) of subtotal and subtract 

SECOND SUBTOTAL 

Assignment Recording Fee ($40.00) 

TOTAL FEE ENCLOSED 



$ 


770.00 


$ 


0.00 


$ 


80.00 


$ 


0.0Q 


$ 


850.00 


-$( 


425.00) 


$ 


425.0G 


$ 


0.00 


$ 


425.00 



The Commissioner is hereby authorized to charge any deficiency in the fee(s) filed, or asserted to be filed, or which should 
have been filed herewith (or with any paper hereafter filed in this application by this firm) to our Account No. 14-1140. A 
duplicate copy of this sheet is attached. 

1100 North Glebe Road 
8 th Floor 

Arlington, Virginia 22201-4714 
Telephone: (703)816-4091 
Facsimile: (703)816-4100 
BJS:msg 



NIXON & VANDERHYE P.C. 

By Atty: B.J. Sadoff, Reg. No. 36,663 



Signature 




IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re PATENT APPLICATION of AttyRef.: 1487-17 

MAERTENS, et al. Group: Unassigned 

Application No.: NOT YET ASSIGNED Examiner: Unassigned 

(DIVISIONAL OF APPLICATION NO. 08/612,973) 

Filed: Herewith 

For: PURIFIED HEPATITIS C VIRUS ENVELOPE 

PROTEINS FOR DIAGNOSTIC AND THERAPEUTIC USE 

September 12, 1997 

PRELIMINARY AMENDMENT 

Honorable Commissioner of 
Patents and Trademarks 
Washington, D.C. 20231 

Sir: 

Preliminarily amend the above-identified application as follows. 

IN THE SPECIFICATION : 

Amend the specification as follows. 

Insert the attached SEQUENCE LISTING before the claims pages and renumber 
subsequent pages accordingly. 

Please insert the attached "ABSTRACT" after the claims pages. 
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MAERTENS, et aL - Not Yet Assigned 
(Divisional of Appln. No. 08/612,973) 
Preliminary Amendment 



IN THE CLAIMS : 

Amend the claims as follows. 
Cancel claims 1-48, without prejudice. 
Add the following new claims. 

-49. A vaccine composition obtained by immunizing a mammal with an effective 
amount of: 

a composition comprising purified recombinant HCV single or specific oligomeric 
recombinant envelope proteins selected from the group consisting of El and/or E2 and/or E1/E2; 
and optionally a pharmaceutical^ acceptable adjuvant. 

50. A composition according to claim 49 wherein said recombinant HCV envelope 
proteins are produced by recombinant mammalian cells. 

SLA composition according to claim 49 wherein said recombinant HCV envelope 
proteins are produced by recombinant yeast cells. 
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MAERTENS, et al. - Not Yet Assigned 
(Divisional of Appln. No. 08/612,973) 
Preliminary Amendment 

52. A vaccine composition obtained by immunizing a mammal with an effective amount 
of a composition comprising purified recombinant HCV single or specific oligomeric 
recombinant envelope proteins selected from the group consisting of El and/or E2 and/or E1/E2, 
and optionally a pharmaceutically acceptable adjuvant; 

said proteins being the expression product of at least one recombinant vector selected 
from the group consisting of: 

a) a recombinant vector comprising a vector sequence, a prokaryotic, eukaryotic or viral 
promoter sequence followed by a nucleotide sequence allowing the expression of said single or 
specific oligomeric El and/or E2 and/or E1/E2 protein; 

b) a recombinant vector according to (a), with said nucleotide sequence being 
characterized further in that it encodes a single HCV El protein starting in the region between 
amino acid positions 1 and 192 and ending in the region between amino acid positions 250 and 
400; 

c) a recombinant vector according to (b), with said nucleotide sequence being 
characterized further in that it encodes a single HCV El protein starting in the region between 
amino acid positions 117 and 192 and ending in the region between amino acid positions 263 and 
400; 

d) a recombinant vector according to (b) or (c), with said nucleotide sequence being 
characterized further in that in encodes a single HCV El protein bearing a deletion of the first 
hydrophobic domain between positions 264 to 293, plus or minus 8 amino acids; 
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MAERTENS, et al. - Not Yet Assigned 
(Divisional of Appln. No. 08/612,973) 
Preliminary Amendment 

e) a recombinant vector according to (a), with said nucleotide sequence being 
characterized further in that in encodes a single HCV E2 protein starting in the region between 
amino acid positions 290 and 406 and ending in the region between amino acid positions 600 and 
820; 

f) a recombinant vector according to (e), with said nucleotide sequence being 
characterized further in that it ends at any of amino acid positions 623, 650, 661, 673, 710, 715, 
720, 746 or 809; 

g) a recombinant vector according to any one of (b)-(f), said nucleotide sequence further 
comprising a 5 '-terminal ATG codon and a 3 '-terminal stop codon; and 

h) a recombinant vector according to any one of (b)-(g) further comprising a factor Xa 
cleavage site and/or 3 to 10 histidine codons positioned 3 '-terminally to said nucleotide 
sequence. 

53. A vaccine composition obtained by immunizing a mammal with an effective amount 
of a composition comprising at least one of the following El and/or E2 peptides: 

El-31 (SEQ ID NO:56) spanning amino acids 181 to 200 of the Core/El VI region, 

El-33 (SEQ ID NO:57) spanning amino acids 193 to 212 of the El region, 

El-35 (SEQ ID NO:58) spanning amino acids 205 to 224 of the El V2 region (epitope 

B), 

E1-35A (SEQ ID NO:59) spanning amino acids 208 to 227 of the El V2 region (epitope 

B), 
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MAERTENS, et al. - Not Yet Assigned 
(Divisional of Appln. No. 08/612,973) 
Preliminary Amendment 

IbEl (SEQ ID NO:53) spanning amino acids 192 to 228 of El regions VI, CI, and V2 
regions (containing epitope B), 

El-51 (SEQ ID NO:66) spanning amino acids 301 to 320 of the El region, 

El-53 (SEQ ID NO:67) spanning amino acids 313 to 332 of the El C4 region (epitope 

A), 

El -55 (SEQ ID NO:68) spanning amino acids 325 to 344 of the El region, 
Env 67 or E2-67 (SEQ ID NO:72) spanning amino acid positions 397 to 418 of the E2 
region (epitope A), 

Env 69 or E2-69 (SEQ ID NO:73) spanning amino acid positions 409 to 428 of the E2 
region (epitope A), 

Env 23 or E2-23 (SEQ ID NO:86) spanning positions 583 to 602 of the E2 region 
(epitope E), 

Env 25 or E2-25 (SEQ ID NO:87) spanning positions 595 to 614 of the E2 region 
(epitope E), 

Env 27 or E2-27 (SEQ ID NO:88) spanning positions 607 to 626 of the E2 region 
(epitope E), 

Env 178 or E2-178 (SEQ ID NO:83) spanning positions 547 to 586 of the E2 region 
(epitope D), and 

Env 13B or E2-13B (SEQ ID NO:82) spanning positions 523 to 542 of the E2 region 
(epitope C). 
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MAERTENS, et al. - Not Yet Assigned 
(Divisional of Appln. No. 08/612,973) 
Preliminary Amendment 

54. A vaccine composition obtained by immunizing a mammal with an effective amount 
of a composition comprising at least one E2 conformational epitope selected from the group 
consisting of 

epitope F recognized by monoclonal antibodies 15C8C1, 12D11F1, and 8G10D1H9, 
epitope G recognized by monoclonal antibody 9G3E6, 

epitope H (or C) recognized by monoclonal antibodies 10D3C4 and 4H6B2, and 
epitope I recognized by monoclonal antibody 17F2C2. 

55. A method of immunizing a mammal against HCV comprising administering an 
effective amount of a composition according to any one of claims 49-51 and, optionally, a 
pharmaceutically acceptable adjuvant. 

56. The method of claim 53 wherein said mammal is a human. -- 

REMARKS 

Claims 1-48 have been canceled, without prejudice. 

Claims 49-56 have been added. The present divisional application has been filed to 
pursue the allegedly distinct invention of Group X (claim 37) of the restriction requirement of 
May 28, 1997 in the parent Application No. 08/612,973. 



6 



234126 





MAERTENS, et al. - Not Yet Assigned 
(Divisional of Appln. No. 08/612,973) 
Preliminary Amendment 

The attached paper copy of the SEQUENCE LISTING is the same as the paper and 
computer readable form of the SEQUENCE LISTING filed in the parent Application No. 
08/612,973. No new matter has been added. Pursuant to Rule 822(e) no further computer 
readable copy of the SEQUENCE LISTING is believed required. A separate "Letter" is attached 
as required by the same Rule. The Office is requested to contact the undersigned if anything 
further is required at this time. 

An early and favorable action on the merits is requested. 



Respectfully submitted, 



NIXON & VANDERHYE, P.C. 




Reg. No. 36,663 

Tel. No. (703) 816-4091 

Fax No. (703) 816-4100 



1 100 North Glebe Road; 8 th Floor 
Arlington, Virginia 22201-4714 
Tel. No. (703) 816-4000 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re PATENT APPLICATION of AttyRef.: 1487-17 

MAERTENS, et al. Group: Unassigned 

Application No.: NOT YET ASSIGNED Examiner: Unassigned 

(DIVISIONAL OF APPLICATION NO. 08/612,973) 

Filed: Herewith 

For: PURIFIED HEPATITIS C VIRUS ENVELOPE 

PROTEINS FOR DIAGNOSTIC AND THERAPEUTIC USE 

September 12, 1997 

LETTER 

Honorable Commissioner of 
Patents and Trademarks 
Washington, D.C. 20231 

Sir: 

Pursuant to Rule 822(e), the applicants note the computer readable form of the Sequence 
Listing of this new application is identical with the computer readable form of another 
application of the applicant on file in the Office. That other application is Application No. 
08/612,973, filed March 11, 1996. This reference to the other application and computer readable 
form is being made in lieu of filing a duplicate computer readable form in this new application. 

Respectfully submitted, 
NIXON & VANDERHYE, P.C. 



By: 




Reg. No. 36,663 

Tel. No. (703) 816-4091 

Fax No. (703) 816-4100 

1 100 North Glebe Road; 8 th Floor 
Arlington, Virginia 22201-4714 




SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: MAERTENS, GEERT 
BOSMAN, FONS 
DE MARTYNOFF, GUY 
BUYSE, MARIE-ANGE 

(ii) TITLE OF INVENTION: PURIFIED HEPATITIS C VIRUS ENVELOPE 
PROTEINS FOR DIAGNOSTIC AND THERAPEUTIC USE 

(iii) NUMBER OF SEQUENCES: 111 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NIXON & VANDERHYE P.C. 

(B) STREET: 1100 NORTH GLEBE ROAD 

(C) CITY: ARLINGTON 

(D) STATE: VIRGINIA 

(E) COUNTRY: U.S.A. 

(F) ZIP: 22201-4714 

{v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/612,973 

(B) FILING DATE: ll-MAR-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: BYRNE, THOMAS E. 

(B) REGISTRATION NUMBER: 32,205 

(C) REFERENCE/ DOCKET NUMBER: 1487-10 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (703) 816-4000 

(B) TELEFAX: (703) 816-4100 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
{C} STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



HCVENV.-T 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GGCATGCAAG CTTAATTAAT T 21 
{2} INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

^CCGGGGAGGC CTGCACGTGA TCGAGGGCAG ACACCATCAC CACCATCACT AATAGTTAAT 60 

JTAACTGCA 68 

J{2) INFORMATION FOR SEQ ID NO: 3: 

j (i) SEQUENCE CHARACTERISTICS: 

% (A) LENGTH: 642 base pairs 

\ (B) TYPE: nucleic acid 

4 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

] (ii) MOLECULE TYPE: cDNA 

| (iii) HYPOTHETICAL: NO 

\ (iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..639 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..636 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA CTG TCC TGT 48 
Met Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 

CTG ACC ATT CCA GCT TCC GCT TAT GAG GTG CGC AAC GTG TCC GGG ATG 96 
Leu Thr lie Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Met 
20 25 30 

TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG TAT GAG GCA 144 
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Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val Tyr Glu Ala 

35 40 45 

GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC GTT CGG GAG 192 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 

50 55 60 

AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG CTC GCA GCT 240 

Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 

65 70 75 80 

AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC GTC GAT TTG 288 

Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 

85 90 95 

CTC GTT GGG GCG GCT GCT CTC TGT TCC GCT ATG TAC GTG GGG GAT CTC 336 

Leu Val Gly Ala Ala Ala Leu Cys Ser Ala Met Tyr Val Gly Asp Leu 

100 105 110 

TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC TCG CCT CGC 384 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He Ser Pro Arg 

115 120 125 



Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 



432 



ATA ACA GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT 480 
He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro 
145 150 155 160 

ACA ACG GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA CAA GCT GTC 528 
Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro Gin Ala Val 
165 170 175 

GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG GGC CTC GCC 57 6 

Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala 
180 185 190 

TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG ATT GTG ATG CTA 624 
Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He Val Met Leu 
195 200 205 

CTC TTT GCT CTC TAATAG 642 
Leu Phe Ala Leu 
210 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 



Leu Thr lie Pro Ma Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Met 
20 25 30 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val Tyr Glu Ala 
35 40 45 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 

Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

Leu Val Gly Ala Ala Ala Leu Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr lie Ser Pro Arg 
115 120 125 

Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

**'>ile Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro 
:5L45 150 155 160 

;0Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro Gin Ala Val 
-•j 165 170 175 

; :: Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala 
180 185 190 

3ryr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He Val Met Leu 
y 195 200 205 

■;=;Leu Phe Ala Leu 
* 210 



{2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..792 

(ix) FEATURE: 



# 



. (A) NAME/KEY: matjpeptide 
(B) LOCATION: 1..789 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 
Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 14 4 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 24 0 

_ieu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
Q 65 70 75 80 

CJCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
n'Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 

fifl 85 90 95 

; 3rAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 

y; ?Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
^ 100 105 110 

OGTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 

yWal Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 

U H5 120 125 

^HCTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 432 

^MLeu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
H 130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 480 
Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 
Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 57 6 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 
Pro Gly His lie Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

TGG TCG CCT ACA ACG GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA 672 
Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
210 215 220 
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CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG 720 
Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
225 230 235 240 

GGT CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG ATT 763 
Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He 
245 250 255 

GTG ATG CTA CTC TTT GCT CCC TAATAG 795 
Val Met Leu Leu Phe Ala Pro 
260 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

3Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
D 15 10 15 

rival Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 

Ll 20 25 30 

^=Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
"4 35 40 * 45 

'■^hr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 

3 50 55 60 

;f£eu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
^ 65 70 75 80 

--Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 
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Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
210 215 220 

Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
225 230 235 240 

Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He 
245 250 255 

Val Met Leu Leu Phe Ala Pro 
260 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 

(B) TYPE: nucleic acid 
{C} STRANDEDNESS : single 
(D) TOPOLOGY: linear 

Q (ii) MOLECULE TYPE : cDNA 

\;3 (iii) HYPOTHETICAL: NO 

m (iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..630 

(ix) FEATURE: 

(A) NAME /KEY: mat__peptide 

(B) LOCATION: 1.-627 



4 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACG TGC GGC TTC GCC GAC CTC 48 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

ATG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGT GCT GCC AGA 96 
Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA 144 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCT GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC ATT CCA GCT TCC GCT TAT GAG GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 
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TCC GGG ATG TAC CAT GTC ACG AAC GAG TGC TCC AAC TCA AGC ATT GTG 288 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
Tyr Glu Ala Ala Asp Met lie Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACT ACG ACA ATA CGA CGC CAC 432 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 480 
Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

GGG GAT CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 
Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

iiSCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 57 6 

ufSer Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 

;y 180 185 190 

□TCC GGC CAC ATA ACA GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 

."J>ro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

s TGG TAATAG 633 
HTrp 

,n 210 



2) INFORMATION FOR SEQ ID NO: 8: 

"4 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 



Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 



Leu Ser Cys Leu Thr lie Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
85 90 95 

Tyr Glu Ala Ala Asp Met lie Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr lie 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

rSro Gly His lie Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
W 195 200 205 



fllTrp 



112) INFORMATION FOR SEQ ID NO: 9: 

s (i) SEQUENCE CHARACTERISTICS: 
Q (A) LENGTH: 483 base pairs 

yj (B) TYPE: nucleic acid 

i ; I (C) STRANDEDNESS : single 

m (D) TOPOLOGY: linear 

fi (ii) MOLECULE TYPE: cDNA 

. (iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..480 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..477 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCC CTG CTG TCC TGT 48 
Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 

CTG ACC ATA CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG TCC GGG GTG 96 



Leu Thr lie Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
20 25 30 

TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATA GTG TAT GAG GCA 144 
Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val Tyr Glu Ala 
35 40 45 

GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC GTT CGG GAG 192 
Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

GGC AAC TCC TCC CGT TGC TGG GTG GCG CTC ACT CCC ACG CTC GCG GCC 240 
Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 

AGG AAC GCC AGC GTC CCC ACA ACG ACA ATA CGA CGC CAC GTC GAT TTG 283 
Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

CTC GTT GGG GCT GCT GCT TTC TGT TCC GCT ATG TAC GTG GGG GAT CTC 336 
Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

TGC GGA TCT GTT TTC CTT GTT TCC CAG CTG TTC ACC TTC TCA CCT CGC 384 

3Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 

0 115 120 125 

jjCGG CAT CAA ACA GTA CAG GAC TGC AAC TGC TCA ATC TAT CCC GGC CAT 432 

SArg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 

1 130 135 140 

-GTA TCA GGT CAC CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TCC TAATAG 483 

MVal Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser 

145 150 155 160 

(2) INFORMATION FOR SEQ ID NO: 10: 

I (i) SEQUENCE CHARACTERISTICS: 

D (A) LENGTH: 159 amino acids 

4 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 

Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
20 25 30 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 40 45 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 
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Arg Asn Ala Ser Val Pro Thr Thr 
85 

Leu Val Gly Ala Ala Ala Phe Cys 
100 

Cys Gly Ser Val Phe Leu Val Ser 
115 120 

Arg His Gin Thr Val Gin Asp Cys 
130 135 

Val Ser Gly His Arg Met Ala Trp 
145 150 




Thr lie Arg Arg His Val Asp Leu 
90 95 

Ser Ala Met Tyr Val Gly Asp Leu 
105 110 

Gin Leu Phe Thr Phe Ser Pro Arg 
125 

Asn Cys Ser lie Tyr Pro Gly His 
140 



Asp Met Met Met Asn Trp Ser 
155 



(2} INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..477 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..474 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



ATG TCC GGT TGC 


TCT 


TTC 


TCT 


ATC 


TTC 


CTC 


TTG 


GCC 


CTG CTG TCC TGT 


48 


Met Ser Gly Cys 


Ser 


Phe 


Ser 


He 


Phe 


Leu 


Leu 


Ala 


Leu Leu Ser Cys 




1 


5 










10 






15 




CTG ACC ATA CCA 


GCT 


TCC 


GCT 


TAT 


GAA 


GTG 


CGC 


AAC 


GTG TCC GGG GTG 


96 


Leu Thr lie Pro 


Ala 


Ser 


Ala 


Tyr 


Glu 


Val 


Arg 


Asn 


Val Ser Gly Val 




20 










25 








30 




TAC CAT GTC ACG 


AAC 


GAC 


TGC 


TCC 


AAC 


TCA 


AGC 


ATA 


GTG TAT GAG GCA 


144 


Tyr His Val Thr 


Asn 


Asp 


Cys 


Ser 


Asn 


Ser 


Ser 


He 


Val Tyr Glu Ala 




35 








40 










45 




GCG GAC ATG ATC 


ATG 


CAC 


ACC 


CCC 


GGG 


TGC 


GTG 


CCC 


TGC GTT CGG GAG 


192 


Ala Asp Met lie 


Met 


His 


Thr 


Pro 


Gly 


Cys 


Val 


Pro 


Cys Val Arg Glu 




50 






55 










60 






GGC AAC TCC TCC 


CGT 


TGC 


TGG 


GTG 


GCG 


CTC 


ACT 


CCC 


ACG CTC GCG GCC 


240 


Gly Asn Ser Ser Arg 


Cys 


Trp 


Val 


Ala 


Leu 


Thr 


Pro 


Thr Leu Ala Ala 




65 




70 










75 




80 
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AGG AAC GCC AGC GTC CCC ACA ACG ACA ATA CGA CGC CAC GTC GAT TTG 288 
Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His Val Asp Leu 
85 90 95 

CTC GTT GGG GCT GCT GCT TTC TGT TCC GCT ATG TAC GTG GGG GAT CJC 336 
Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

TGC GGA TCT GTT TTC CTT GTT TCC CAG CTG TTC ACC TTC TCA CCT CGC 384 
Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

CGG CAT CAA ACA GTA CAG GAC TGC AAC TGC TCA ATC TAT CCC GGC CAT 432 
Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

GTA TCA GGT CAC CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TAATAG 480 
Val Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 12: 

O (i) SEQUENCE CHARACTERISTICS: 
IS (A) LENGTH: 158 amino acids 

:fl (B) TYPE: amino acid 

fU (D) TOPOLOGY: linear 

tj (ii) MOLECULE TYPE: protein 

l f\ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

-=Met Ser Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
O 15 10 15 

fieu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 

;%r His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 

"H 35 40 45 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 

50 55 60 

Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 

65 70 75 80 

Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser lie Tyr Pro Gly His 
130 135 140 

Val Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..633 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1..630 



yp (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

^ATG CTG GGT AAG GCC ATC GAT ACC CTT ACG TGC GGC TTC GCC GAC CTC 

-Tket Leu Gly Lys Ala lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
"% 1 5 10 15 

N3TG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 
sVal Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Aia Ala Arg 
3 20 25 30 

? JCC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA 
rkia Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
Z 35 40 45 

SfLCA GGG AAT TTG CCT GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTA ACC ATT CCA GCT TCC GCT TAC GAG GTG CGC AAC GTG 
Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 
Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 



CTC GCG GCT AGG AAC GCC AGC ATC CCC ACT ACA ACA ATA CGA CGC CAC 



Leu Ala Ala Arg Asn Ala Ser He Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAG GTG 480 
Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

GGG GAT CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 
Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 576 
Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 
Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

TGG TAC TAATAG 640 
Trp Tyr 
210 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Leu Gly Lys Ala lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met lie Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser He Pro Thr Thr Thr He Arg Arg His 
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135 



140 



Val Asp Leu Leu Val Gly 
145 150 



Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
155 160 



Gly Asp Leu Cys Gly Ser 
165 



Val Phe Leu Val Ser Gin Leu Phe Thr He 
170 175 



Ser Pro Arg Arg His Glu 
180 



Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
185 190 



Pro Gly His He Thr Gly 
195 



His Arg Met Ala Trp Asp Met Met Met Asn 
200 205 



Trp Tyr 
210 



(2) INFORMATION FOR SEQ ID NO: 15: 

{1} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL : NO 
(iii) ANTI-SENSE: NO 



□ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

y>TGCCCGGTT GCTCTTTCTC TATCTT 26 
•jf (2) INFORMATION FOR SEQ ID NO: 16: 

H (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ATGTTGGGTA AGGTCATCGA TACCCT 26 
(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
{iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CTATTAGGAC CAGTTCATCA TCATATCCCA 30 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

H (ii) MOLECULE TYPE: cDNA 
!j (iii) HYPOTHETICAL: NO 
j (iii) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

gCTATTACCAG TTCATCATCA TATCCCA 27 

"(2) INFORMATION FOR SEQ ID NO: 19: 

3 (i) SEQUENCE CHARACTERISTICS: 

J . (A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATACGACGCC ACGTCGATTC CCAGCTGTTC ACCATC 36 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GATGGTGAAC AGCTGGGAAT CGACGTGGCG TCGTAT 36 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..720 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..717 



j (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC .GAC CTC 48 
Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 144 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 
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ILL 


rrr 








r*rr 
biL 


HL-Lj 


Alp 




*vrr 




AAC 


TCA 


AGC 


ATT 


GTG 


o o o 
ZOO 








i.yr 


£1X5 


v ax 


1 Xlx 


Asn 


Asp 


Cys 


Oct 


Asn 


Ser 


Ser 


He 


Val 






















y u 










95 






TAT 


GAG 


GCA 


GCG 


GAC 


ATG 


ATC 


ATG 


CAC 


ACC 


ccc 


GGG 


TGC 


GTG 


CCC 


TGC 


336 


Tyr 


biu 


rixa 


.fix a 


Asp 


wet 


Tl fl 
liS 




nlS 


I nr 


Pro 


Gly Cys Val Pro Cys 










XUU 










1 n ^ 

xUj 










110 








GTT 


CGG 


GAG 


AAC 


AAC 


TCT 


TCC 


CGC 


TGC 


TGG 


GTA 


GCG 


CTC 


ACC 


CCC 


ACG 


384 


Val 


Arg 


Glu 


Asn 


Asn 


Ser 


Ser 


Arg 


Cys 


Trp 


Val 


Ala 


Leu 


Thr 


Pro 


Thr 








1 1 c 
XX J 










1 9 Pi 










125 










CTC 


GCA 


GCT 


AGG 


AAC 


GCC 


AGC 


GTC 


ccc 


ACC 


ACG 


ACA 


ATA 


CGA 


CGC 


CAC 


432 


Leu 


Ala 


Ala 


Arg 


Asn 


Ala 


Ser 


Val 


Pro 


Thr 


Thr 


Thr He Arg Arg His 






xju 










i *3R 










140 












GTC 


GAT 


TCC 


CAG 


CTG 


TTC 


ACC 


ATC 


TCG 


CCT 


CGC 


CGG 


CAT 


GAG 


ACG 


GTG 


480 


Val 


Asp 


Ser 


Gin 


Leu 


Phe 


Thr 


He 


Ser 


Pro Arg 


Arg His 


Glu 


Thr 


Val 




X *i -J 










1 so 










155 










160 




CAG 


GAC 


TGC 


AAT 


TGC 


TCA 


ATC 


TAT 


CCC 


GGC 


CAC 


ATA 


ACG 


GGT 


CAC 


CGT 


528 


Gin 


Asp 


Cys 


Asn 


Cys 


Ser 


He 


Tyr 


Pro 


Gly His 


He 


Thr 


Gly His Arg 












1 DO 










170 










175 






ATG 


GCT 


TGG 


GAT 


ATG 


ATG 


ATG 


AAC 


TGG 


TCG 


CCT 


ACA ACG 


GCC 


CTG 


GTG 


576 


Met 


Ala 


Trp 


Asp 


Met 


Met 


Met 


Asn 


Trp 


Ser 


Pro 


Thr 


Thr 


Ala 


Leu 


Val 










ion 
lo U 










IOC 

103 










190 








GTA 


TCG 


CAG 


CTG 


CTC 


CGG 


ATC 


CCA 


CAA 


GCT 


GTC 


GTG 


GAC 


ATG 


GTG 


GCG 


624 


Val 


Ser 


Gin 


Leu 


Leu 


Arg 


He 


Pro 


Gin 


Ala 


Val 


Val 


Asp Met 


Val 


Ala 








1 










inn 










205 










rrr 


err 

uLt 




*vrr 


rm 


rTr 
Lj 1 u 


r*vr 


rrr 


ool 


CTC 


GCC 


TAC 


TAT 


TCC 


ATG 


GTG 




Gly 


Ala 


His 


Trp 


Gly 


Val 


Leu 


Ala 


Gly 


Leu 


Ala 


Tyr 


Tyr 


Ser 


Met: 


Val 






210 










215 










220 












GGG 


AAC 


TGG 


GCT 


AAG 


GTT 


TTG 


ATT 


GTG 


ATG 


CTA 


CTC 


TTT 


GCT 


CCC 


TAATAG 


72 


Gly 


Asn 


Trp 


Ala 


Lys 


Val 


Leu 


He 


Val 


Met 


Leu 


Leu 


Phe 


Ala 


Pro 






225 










230 










235 










240 





(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 



Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 



Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 

50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 

Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

--Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
% 180 185 190 

y^Val Ser Gin Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala 
n 195 200 205 

NGly Ala His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val 

|11 210 215 220 

"Gly Asn Trp Ala Lys Val Leu He Val Met Leu Leu Phe Ala Pro 
1,225 230 235 



(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
<iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..558 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..555 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

-TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 
Tyr Glu Ala Ala Asp Met lie Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 
Val Asp Ser Gin Leu Phe Thr lie Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 
Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His lie Thr Gly His Arg 
165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TAATAG 
Met Ala Trp Asp Met Met Met Asn Trp 
180 185 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 



15 

Gly Gly Ala Ala Arg 
30 

Gly Val Asn Tyr Ala 
45 

Phe Leu Leu Ala Leu 
60 



Glu Val Arg Asn Val 
80 

Asn Ser Ser lie Val 
95 

Gly Cys Val Pro Cys 
110 

Ala Leu Thr Pro Thr 
125 

Thr He Arg Arg His 
140 

Arg His Glu Thr Val 
160 

He Thr Gly His Arg 
175 

ftet Ala Trp Asp Met Met Met Asn Trp 
180 185 



; ^2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 606 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..603 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..600 



10 



Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
20 25 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
35 40 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
50 55 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 
65 70 75 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser 
85 90 

Tyr Glu Ala Ala Asp Met lie Met His Thr Pro 
100 105 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val 
115 120 

fLeu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr 
3 130 135 

^tal Asp Ser Gin Leu Phe Thr lie Ser Pro Arg 
Hi45 150 155 



-£ln Asp Cys Asn Cys Ser He Tyr Pro Gly His 

■H 165 170 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 144 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

■SAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
: ? Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
^ 100 105 110 

OgTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 
JVal Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
1 115 120 125 

= "CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 432 
..Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
i 130 135 140 

-GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 480 
JVal Aso Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
;i45 * 150 155 160 

1CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 528 
;| Gln Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG 57 6 

Met Ala Trp Aso Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

GTA TCG CAG CTG CTC CGG ATC CTC TAATAG 606 
Val Ser Gin Leu Leu Arg He Leu 
195 200 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



-•p\ r=v> f - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

rJ/al Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
2 115 120 125 

ELeu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
y 130 135 140 

Ij/al Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
A4 5 150 155 160 

% bln Aso Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 

L * 165 170 175 

JMet Ala Trp Aso Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
U 180 185 190 



3/al Ser Gin Leu Leu Arg He Leu 
- 195 200 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..633 



(ix) FEATURE: 

(A) NAME/KEY: matjpeptide 

(B) LOCATION: 1..630 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 
Met Leu Gly Lys Val lie Asd Thr Leu Thr Cys Gly Phe Ala Asp Leu 
1 5 * 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 
Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
' 130 135 140 

GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 
Val Asd Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
145 * 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 
Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG 
Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

GTA TCG CAG CTG CTC CGG ATC GTG ATC GAG GGC AGA CAC CAT CAC CAC 
Val Ser Gin Leu Leu Arg He Val He Glu Gly Arg His His His His 
195 200 205 



CAT CAC TAATAG 
His His 



210 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr lie Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

HThr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 

IB 50 55 60 

2-Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
[J 65 70 75 80 

Nser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 

if] 85 90 95 

"Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 

'U 100 105 110 

y %al Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

ilLeu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
y 130 135 140 

Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 

Gin Asd Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

Met Ala Trp Aso Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

Val Ser Gin Leu Leu Arg He Val He Glu Gly Arg His His His His 
195 200 205 

His His 
210 



{2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..627 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..624 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ATG GOT AAG GTC ATC GAT ACC CTT ACG TGC GGA TTC GCC GAT CTC ATG 
Met Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
15 10 15 

■:GGG TAC ATC CCG CTC GTC GGC GCT CCC GTA GGA GGC GTC GCA AGA GCC 
Gly Tyr He Pro Leu Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala 
20 25 30 

CTT GCG CAT GGC GTG AGG GCC CTT GAA GAC GGG ATA AAT TTC GCA ACA 
Leu Ala His Gly Val Arg Ala Leu Glu Asp Gly He Asn Phe Ala Thr 
35 40 45 

GGG AAT TTG CCC GGT TGC TCC TTT TCT ATT TTC CTT CTC GCT CTG TTC 
Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Phe 
50 55 60 

TCT TGC TTA ATT CAT CCA GCA GCT AGT CTA GAG TGG CGG AAT ACG TCT 
Ser Cys Leu He His Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser 
65 70 75 80 

GGC CTC TAT GTC CTT ACC AAC GAC TGT TCC AAT AGC AGT ATT GTG TAC 
Gly Leu Tyr Val Leu Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr 
85 90 95 

GAG GCC GAT GAC GTT ATT CTG CAC ACA CCC GGC TGC ATA CCT TGT GTC 
Glu Ala Asp Asp Val He Leu His Thr Pro Gly Cys He Pro Cys Val 
100 105 110 

CAG GAC GGC AAT ACA TCC ACG TGC TGG ACC CCA GTG ACA CCT ACA GTG 
Gin Asp Gly Asn Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val 
115 120 125 

GCA GTC AAG TAC GTC GGA GCA ACC ACC GCT TCG ATA CGC AGT CAT GTG 
Ala Val Lys Tyr Val Gly Ala Thr Thr Ala Ser He Arg Ser His Val 
130 135 140 



GAC CTA TTA GTG GGC GCG GCC ACG ATG TGC TCT GCG CTC TAC GTG GGT 
Asp Leu Leu Val Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 



GAC ATG TGT GGG GCT GTC TTC CTC GTG GGA CAA GCC TTC ACG TTC AGA 528 

Asp Met Cys Gly Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg 
165 170 175 

CCT CGT CGC CAT CAA ACG GTC CAG ACC TGT AAC TGC TCG CTG TAG CCA 576 

Pro Arg Arg His Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro 
180 185 190 

GGC CAT CTT TCA GGA CAT CGA ATG GCT TGG GAT ATG ATG ATG AAC TGG 624 

Gly His Leu Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 

195 200 205 

TAATAG 634 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

\ (ii) MOLECULE TYPE: protein 

J (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

p Met Gly Lys Val lie Asd Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
j 1 5 10 15 

I Gly Tyr He Pro Leu Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala 
20 25 30 

: Leu Ala His Gly Val Arg Ala Leu Glu Asp Gly He Asn Phe Ala Thr 
35 40 45 

1 Gly Asn Leu Pro Gly Cvs Ser Phe Ser He Phe Leu Leu Ala Leu Phe 
50 ~ 55 60 

= Ser Cys Leu He His Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser 
65 70 75 80 

Gly Leu Tyr Val Leu Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr 
85 90 95 

Glu Ala Asp Asp Val He Leu His Thr Pro Gly Cys He Pro Cys Val 
100 105 110 

Gin Asp Gly Asn Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val 
115 120 125 

Ala Val Lys Tyr Val Gly Ala Thr Thr Ala Ser He Arg Ser His Val 
130 135 140 

Asp Leu Leu Val Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 

Asp Met Cys Gly Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg 
165 170 175 



Pro Arg Arg His Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro 
180 185 190 

Gly His Leu Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

"\i (ix) FEATURE: 

I~ (A) NAME /KEY: CDS 

]j£ (B) LOCATION: 1. .627 

%! (ix) FEATURE: 

m (A) NAME /KEY : mat_peptide 

Cj (3) LOCATION: 1..624 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

HVTG GGT AAG GTC ATC GAT ACC CTA ACG TGC GGA TTC GCC GAT CTC ATG 48 

^Met Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 

yg 15 10 15 

'%GG TAT ATC CCG CTC GTA GGC GGC CCC ATT GGG GGC GTC GCA AGG GCT 96 
Gly Tyr He Pro Leu Val Gly Gly Pro He Gly Gly Val Ala Arg Ala 
20 25 30 

CTC GCA CAC GGT GTG AGG GTC CTT GAG GAC GGG GTA AAC TAT GCA ACA 144 
Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr 
35 40 45 

GGG AAT TTA CCC GGT TGC TCT TTC TCT ATC TTT ATT CTT GCT CTT CTC 192 
Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe He Leu Ala Leu Leu 
50 55 60 

TCG TGT CTG ACC GTT CCG GCC TCT GCA GTT CCC TAC CGA AAT GCC TCT 240 
Ser Cys Leu Thr Val Pro Ala Ser Ala Val Pro Tyr Arg Asn Ala Ser 
65 70 75 80 

GGG ATT TAT CAT GTT ACC AAT GAT TGC CCA AAC TCT TCC ATA GTC TAT 288 
Gly He Tyr His Val Thr Asn Asp Cys Pro Asn Ser Ser He Val Tyr 
85 90 95 

GAG GCA GAT AAC CTG ATC CTA CAC GCA CCT GGT TGC GTG CCT TGT GTC 336 



Glu Ala Asp Asn Leu He Leu His Ala Pro Gly Cys Val Pro Cys Val 
100 105 110 

ATG ACA GGT AAT GTG AGT AGA TGC TGG GTC CAA ATT ACC CCT ACA CTG 384 
Met Thr Gly Asn Val Ser Arg Cys Trp Val Gin He Thr Pro Thr Leu 
115 120 125 

TCA GCC CCG AGC CTC GGA GCA GTC ACG GCT CCT CTT CGG AGA GCC GTT 432 
Ser Ala Pro Ser Leu Gly Ala Val Thr Ala Pro Leu Arg Arg Ala Val 
130 135 140 

GAC TAC CTA GCG GGA GGG GCT GCC CTC TGC TCC GCG TTA TAC GTA GGA 480 
Asp Tyr Leu Ala Gly Gly Ala Ala Leu Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 

GAC GCG TGT GGG GCA CTA TTC TTG GTA GGC CAA ATG TTC ACC TAT AGG 528 
Asp Ala Cys Gly Ala Leu Phe Leu Val Gly Gin Met Phe Thr Tyr Arg 
165 170 175 

CCT CGC CAG CAC GCT ACG GTG CAG AAC TGC AAC TGT TCC ATT TAC AGT 576 
Pro Arg Gin His Ala Thr Val Gin Asn Cys Asn Cys Ser He Tyr Ser 
180 185 190 

p€GC CAT GTT ACC GGC CAC CGG ATG GCA TGG GAT ATG ATG ATG AAC TGG 624 
!:;Gly His Val Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
y f 195 200 205 



If AAT AG ^30 



"(2) INFORMATION FOR SEQ ID NO: 32: 

3 (i) SEQUENCE CHARACTERISTICS: 

0 (A) LENGTH: 208 amino acids 
(B) TYPE: amino acid 

y (D) TOPOLOGY: linear 

1 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
15 10 15 

Gly Tyr He Pro Leu Val Gly Gly Pro lie Gly Gly Val Ala Arg Ala 
20 25 30 

Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr 
35 40 45 

Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe He Leu Ala Leu Leu 
50 55 60 

Ser Cys Leu Thr Val Pro Ala Ser Ala Val Pro Tyr Arg Asn Ala Ser 
65 70 75 80 

Gly He Tyr His Val Thr Asn Asp Cys Pro Asn Ser Ser He Val Tyr 
85 90 95 



Glu Ala Asp Asn Leu He Leu His Ala Pro Gly Cys Val Pro Cys Val 
100 105 110 

Met Thr Gly Asn Val Ser Arg Cys Trp Val Gin He Thr Pro Thr Leu 
115 120 125 

Ser Ala Pro Ser Leu Gly Ala Val Thr Ala Pro Leu Arg Arg Ala Val 
130 135 140 

Asp Tyr Leu Ala Gly Gly Ala Ala Leu Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 

Asp Ala Cys Gly Ala Leu Phe Leu Val Gly Gin Met Phe Thr Tyr Arg 
165 170 175 

Pro Arg Gin His Ala Thr Val Gin Asn Cys Asn Cys Ser He Tyr Ser 
130 185 190 

Gly His Val Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
195 200 205 



■\{2) INFORMATION FOR SEQ ID NO: 33: 

J (i) SEQUENCE CHARACTERISTICS: 

J (A) LENGTH: 23 base pairs 

-J (B) TYPE: nucleic acid 

1 (C) STRANDEDNESS : single 

j (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

} (iii) HYPOTHETICAL: NO 

J (iii) ANTI-SENSE: NO 



' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TGGGATATGA TGATGAACTG GTC 

(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



CTATTATGGT GGTAAGCCAC AGAGCAGGAG 30 
{2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1473 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 
rf (B) LOCATION: 1..1470 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



^GG 


GAT 


ATG 


ATG 


ATG 


AAC 


TGG 


TCG 


CCT 


ACA 


ACG 


GCC 


CTG GTG GTA TCG 


48 


IS rp 


Asp 


Met 


Met 


Met 


Asn 


Trp 


Ser 


Pro 


Thr 


Thr 


Ala 


Leu Val Val Ser 




Cj 1 








5 










10 






15 




:cag 


CTG 


CTC 


CGG 


ATC 


CCA 


CAA 


GCT 


GTC 


GTG 


GAC 


ATG 


GTG GCG GGG GCC 


96 


Liln 


Leu 


Leu 


Arg 


He 


Pro 


Gin 


Ala 


Val 


Val 


Asp 


Met 


Val Ala Gly Ala 










20 










25 








30 




fCAT 


TGG 


GGA 


GTC 


CTG 


GCG 


GGC 


CTC 


GCC 


TAC 


TAT 


TCC 


ATG GTG GGG AAC 


144 


Mis 


Trp 


Gly 


Val 


Leu 


Ala 


Gly 


Leu 


Ala 


Tyr 


Tyr 


Ser 


Met Val Gly Asn 








35 










40 










45 




^GG 


GCT 


AAG 


GTT 


TTG 


GTT 


GTG 


ATG 


CTA 


CTC 


TTT 


GCC 


GGC GTC GAC GGG 


192 


Trp 


Ala 


Lys 


Val 


Leu 


Val 


Val 


Met 


Leu 


Leu 


Phe 


Ala 


Gly Val Asp Gly 






50 










55 










60 






CAT 


ACC 


CGC 


GTG 


TCA 


GGA 


GGG 


GCA 


GCA 


GCC 


TCC 


GAT 


ACC AGG GGC CTT 


240 


His 


Thr 


Arg 


Val 


Ser 


Gly 


Gly 


Ala 


Ala 


Ala 


Ser 


Asp 


Thr Arg Gly Leu 




65 










70 










75 




80 




GTG 


TCC 


CTC 


TTT 


AGC 


CCC 


GGG 


TCG 


GCT 


CAG 


AAA 


ATC 


CAG CTC GTA AAC 


288 


Val 


Ser 


Leu 


Phe 


Ser 


Pro 


Gly 


Ser 


Ala 


Gin 


Lys 


He 


Gin Leu Val Asn 












85 










90 






95 




ACC 


AAC 


GGC 


AGT 


TGG 


CAC 


ATC 


AAC 


AGG 


ACT 


GCC 


CTG 


AAC TGC AAC GAC 


•336 


Thr 


Asn 


Gly 


Ser 


Trp 


His 


He 


Asn 


Arg 


Thr 


Ala 


Leu 


Asn Cys Asn Asp 










100 










105 








110 




TCC 


CTC 


CAA 


ACA 


GGG 


TTC 


TTT 


GCC 


GCA 


CTA 


TTC 


TAC 


AAA CAC AAA TTC 


384 


Ser 


Leu 


Gin 


Thr 


Gly 


Phe 


Phe 


Ala 


Ala 


Leu 


Phe 


Tyr 


Lys His Lys Phe 








115 










120 










125 




AAC 


TCG 


TCT 


GGA 


TGC 


CCA 


GAG 


CGC 


TTG 


GCC 


AGC 


TGT 


CGC TCC ATC GAC 


432 




Asn 


Ser 


Ser 


Gly 


Cys 


Pro 


Glu 


Arg 


Leu 


Ala 


Ser 


Cys 


Arg 


Ser 


He 


Asp 






130 










135 










140 












AAG 


TTC 


GCT 


CAG 


GGG 


TGG 


GGT 


bbb 


bib 


ACT 


TAC 


ACT 


CAC 


rr*v 
CCi 


AAL 


ACC 


a q n 
4 o U 


Lys 


Phe 


Ala 


Gin 


Gly 


Trp 


Gly 


Pro 


Leu 


Thr 


Tyr 


Thr 


Glu 


Pro 


Asn 


Ser 




145 










150 










155 










160 




TCG 


GAC 


CAG 


AGG 


ccc 


TAC 


TGC 


TGG 


CAC 


TAC 


GCG 


CCi 


CCA 


rrr 
CCC 


TCT 


GCT 




Ser 


Asp 


Gin 


Arg 


Pro 


Tyr 


Cys 


Trp 


His 


Tyr 


Ala 


Pro 


Arg 


Pro 


Cys 


Gly 












165 










170 










175 






ATT 


GTA 


CCC 


CCC 


ICi 


/""TV/** 

CAC 


bib 


ibb 


bbi 


bbA 


CI b 


TAT 
1 Ai 


iLL 


TTr* 
lie 


Arr 
ACC 


err 
CCC 


^7 £ 
3/0 


He 


Val 


Pro 


Ala 


Ser 


Gin 


Val 


Cys 


Gly 


Pro 


Val 


Tyr 


Cys 


Phe 


Thr 


Pro 










180 










185 










190 








ACC 


CCi 


/"■mm 
Ul 1 


bib 


bib 


bbb 


Abb 


Abb 


rail 

CAi 


rrr 

CCC 


mmm 

ill 


V3V3 1 


UiL 


rrr 


ALL 


TUT 
1 AI 


OZ *» 


Ser 


Pro 


Val 


Val 


Val 


Gly 


Thr 


Thr 


Asp 


Arg 


Phe 


Gly 


Val 


Pro 


Thr 


Tyr 








195 










200 










205 










AAC 


TGC 


bbb 


bbb 


AAC 


bAb 


Ibb 


CA1 


err 

bib 


Lib 


B.TT 
Ai 1 


Li L 


a zip 

nAL 


a ar 

AAL 


ACC 
ALL 


err 
CCC 


£7 9 
0 / Z 


Asn 


Trp 


Gly 


Ala 


Asn 


Asp 


Ser 


Asp 


Val 


Leu 


He 


Leu 


Asn 


Asn 


Thr 


Arg 






210 










215 










220 












fCCG 


CCG 


CGA 


GGC 


AAC 


TGG 


TTC 


GGC 


TGT 


ACA 


TGG 


7\ rp /-* 

ATG 


AAT 


GGC 


ACT 


GGG 


7ZU 


:3>ro 


Pro 


Arg 


Gly 


Asn 


Trp 


Phe 


Gly 


Cys 


Thr 


TrD 


Met 


Asn 


Gly 


Thr 


Gly 




y |25 










230 










235 










240 




flfTC 


ACC 


AAG 


ACG 


TGT 


GGG 


GGC 


CCC 


CCG 


TGC 


AAC 


ATC 


GGG 


GvjG 


GCC 


GGC 


7 bo 


EPhe 


Thr 


Lys 


Thr 


Cys 


Gly 


Gly 


Pro 


Pro 


Cys 


Asn 


lie 


Ciy 


ciy 


Ala 


Cj.y 












A A C 

245 










OCA 










ZOO 






:;'£ac 


AAC 


ACC 


TTG 


ACC 


TGC 


CCC 


ACT 


GAC 


TGT 


TTT 


CGG 


AAG 


CAC 


CCC 


GAG 


816 


"Asn 


Asn 


Thr 


Leu 


Thr 


Cys 


Pro 


Thr 


Asp 


Cys 


Phe 


Arg 


Lys 


His 


Pro 


Glu 










260 










o c c 
zoo 










o *7 n 








- C r^r* 


ACC 


1 AC 


bCC 


ACA 


Ibb 


bbi 


ICi 


err 
CCC 


rrr 
CCC 


ICC 


c 1C 


ACA 


LL 1 


ACva 


TPT 
1C 1 






Thr 


Tyr 


Ala 


Arg 


Cys 


Gly 


Ser 


Gly 


Pro 


Trp 


Leu 


Thr 


Pro 


Arg 


Cys 








275 










280 










285 










. fATC 


Ci i 


/"» 7\ rp 

CA1 




CCA 


Tfl T 
1A1 


Abb 


bib 


i uo 


CAC 


i AC 


rrr 

LLL 


iuL 


AL 1 


rrpp 
uiL 


AAC 
AAL 


Ql 9 




Val 


His 


Tyr 


Pro 


Tyr 


Arg 


Leu 


Trp 


His 


Tyr 


Pro 


Cys 


Thr 


Val 


Asn 






290 










295 










300 












TTC 


ACC 


ATC 


TTC 


AAG 


GTT 


AGG 


ATG 


TAC 


GTG 


GGG 


GGC 


CTC 


/"* TV /**■ 

GAG 


CAC 


AGG 


Q £ f\ 


Phe 


Thr 


He 


Phe 


Lys 


Val 


Arg 


Met 


Tyr 


Val 


Gly 


Gly 


Val 


Glu 


His 


Arg 




305 










310 










315 










320 




TTC 


GAA 


GCC 


GCA 


TGC 


AAT 


TGG 


ACT 


CGA 


GGA 


GAG 


CGT 


TGT 


GAC 


TTG 


GAG 


T A A a 

1008 


Phe 


Glu 


Ala 


Ala 


Cys 


Asn 


Trp 


Thr 


Arg 


Gly 


Glu 


Arg 


Cys 


Asp 


Leu 


Glu 












325 










330 










335 






GAC 


AGG 


GAT 


AGA 


TCA 


GAG 


CTT 


AGC 


CCG 


CTG 


CTG 


CTG 


TCT 


ACA 


ACA 


GAG 


1056 


Asp 


Arg 


Asp 


Arg 


Ser 


Glu 


Leu 


Ser 


Pro 


Leu 


Leu 


Leu 


Ser 


Thr 


Thr 


Glu 










340 










345 










350 








TGG 


CAG 


ATA 


CTG 


CCC 


TGT 


TCC 


TTC 


ACC 


ACC 


CTG 


CCG 


GCC 


CTA 


TCC 


ACC 


"1 T A A 

1104 


Trp 


Gin 


He 


Leu 


Pro 


Cys 


Ser 


Phe 


Thr 


Thr 


Leu 


Pro 


Ala 


Leu 


Ser 


Thr 






355 








360 










365 










GGC 


CTG 


ATC 


CAC 


CTC 


CAT 


CAG 


AAC 


ATC 


GTG 


GAC 


GTG 


CAA 


TAC 


CTG 


TAC 


1152 


Gly 


Leu 


He 


His 


Leu 


His 


Gin 


Asn 


He 


Val 


Asp 


Val 


Gin 


Tyr 


Leu 


Tyr 




370 










375 










380 













GGT GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC AAA TGG GAG TAT GTC 1200 
Gly Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val 
385 390 395 400 

CTG TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC ATC TGC GCC TGC TTA 1248 
Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu 
405 410 415 

TGG ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC GCC TTA GAG AAC CTG 1296 
Trp Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu 
420 425 430 

GTG GTC CTC AAT GCG GCG GCC GTG GCC GGG GCG CAT GGC ACT CTT TCC 1344 
Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser 
435 440 445 

TTC CTT GTG TTC TTC TGT GCT GCC TGG TAC ATC AAG GGC AGG CTG GTC 1392 
Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val 
450 455 460 

CCT GGT GCG GCA TAC GCC TTC TAT GGC GTG TGG CCG CTG CTC CTG CTT 14 40 

Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu 
^465 470 475 480 

%TG CTG GCC TTA CCA CCA CGA GCT TAT GCC TAGTAA 1476 
yQteu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
fy 485 490 

C j ( 2 ) INFORMATION FOR SEQ ID NO: 36: 

;- : : (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 90 amino acids 
s (B) TYPE: amino acid 

Q (D) TOPOLOGY: linear 

[1 (ii) MOLECULE TYPE: protein 

! H (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



Nbrp Asd Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser 
.1 5 10 15 

Gin Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala 
20 25 30 

His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn 
35 40 45 

Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly 
50 55 60 

His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu 
65 70 75 80 

Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn 
85 90 95 

Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp 
100 105 110 



HCVENV-T 



Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe 
115 120 125 

Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp 
130 135 140 

Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser 
145 150 155 160 

Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly 
165 170 175 

He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro 
180 185 190 

Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr 
195 200 205 

Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg 
210 215 220 

Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly 
225 230 235 240 

::1phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly 

2 245 250 255 

Iksn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu 
D 260 265 270 

•^Aia Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys 
I 275 280 235 

Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn 

3 290 295 300 

=*Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg 
||305 310 315 320 

Hphe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu 
^ 325 330 335 

Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu 
340 345 350 

Trp Gin He Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr 
355 360 365 

Gly Leu He His Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr 
370 375 380 

Gly Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val 
385 390 395 400 

Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu 
405 410 415 

Trp Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu 
420 425 430 

Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser 



HCVENV.-T 



435 440 445 

Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val 
450 455 460 

Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu 
465 470 475 480 

Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
485 490 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
, (iii) ANTI-SENSE: NO 

i (ix) FEATURE: 

j (A) NAME /KEY: CDS 

| (B) LOCATION:. 2. .1018 

t (ix) FEATURE: 

! (A) NAME /KEY : mat_peptide 

= (B) LOCATION: 2.. 1015 

\ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

\ G ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA 4 6 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly 
j 1 5 10 15 

GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG 94 
Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys 
20 25 30 

GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC 142 
Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg 
35 40 45 

GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC 190 
Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu 
50 55 60 

TTT AGC CCC GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC AAC GGC 238 
Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly 
65 70 75 

AGT TGG CAC ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC CTC CAA 28 6 

Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin 
80 85 90 95 



ACA GGG TTC TTT GCC GCA CTA TTC TAG AAA CAC AAA TTC AAC TCG TCT 334 
Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser 
100 105 110 

GGA TGC CCA GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG TTC GCT 382 
Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala 
115 120 125 

CAG GGG TGG GGT CCC CTC ACT TAC ACT GAG CCT AAC AGC TCG GAC CAG 430 
Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin 
130 135 140 

AGG CCC TAC TGC TGG CAC TAC GCG CCT CGA CCG TGT GGT ATT GTA CCC 478 
Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro 
145 150 155 

GCG TCT CAG GTG TGC GGT CCA GTG TAT TGC TTC ACC CCG AGC CCT GTT 526 
Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val 
160 165 170 175 

GTG GTG GGG ACG ACC GAT CGG TTT GGT GTC CCC ACG TAT AAC TGG GGG 574 
Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly 
180 185 190 

;:jGCG AAC GAC TCG GAT GTG CTG ATT CTC AAC AAC ACG CGG CCG CCG CGA 622 
i; :';Ala Asn Aso Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg 
^ " 195 * 200 205 

?|GGC AAC TGG TTC GGC TGT ACA TGG ATG AAT GGC ACT GGG TTC ACC AAG 670 
-JGly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys 
IP 210 215 220 

7 %CG TGT GGG GGC CCC CCG TGC AAC ATC GGG GGG GCC GGC AAC AAC ACC 718 

Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn Asn Thr 
□ 225 230 235 

ClTTG ACC TGC CCC ACT GAC TGT TTT CGG AAG CAC CCC GAG GCC ACC TAC 7 66 

r^Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr 
;^24G 245 250 255 

~^GCC AGA TGC GGT TCT GGG CCC TGG CTG ACA CCT AGG TGT ATG GTT CAT 814 
Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His 
260 265 270 

TAC CCA TAT AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC TTC ACC ATC 862 
Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He 
275 280 285 

TTC AAG GTT AGG ATG TAC GTG GGG GGC GTG GAG CAC AGG TTC GAA GCC 910 
Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala 
290 295 300 

GCA TGC AAT TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG GAC AGG GAT 958 
Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp 
305 310 315 

AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG CAG AGT 1006 
Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Ser 
320 325 330 335 

GGC AGA GCT TAATTA 1021 



Gly Arg Ala 



{2} INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 338 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 
15 10 15 

Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val 
20 25 30 

Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val 
35 40 45 

Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe 
50 55 60 

:Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser 
65 70 75 80 

Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr 
85 90 95 

Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly 
100 105 110 

Cys Pro GIu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala Gin 
115 120 125 

Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg 
130 135 140 

Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala 
145 150 155 160 

Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
165 170 175 

Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala 
180 185 190 

Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly 
195 200 205 

Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr 
210 215 220 

Cys Gly Gly Pro Pro Cys Asn lie Gly Gly Ala Gly Asn Asn Thr Leu 
225 230 235 240 

Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr 'Ala 
245 250 255 



Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr 
260 265 270 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr lie Phe 
275 280 285 

Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala 
290 295 300 

Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 
305 310 315 320 

Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Ser Gly 
325 330 335 

Arg Ala 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1034 base pairs 
* (3) TYPE: nucleic acid 

t (C) STRANDEDNESS: single 

1 (D) TOPOLOGY: linear 

J (ii) MOLECULE TYPE: cDNA 

j (iii) HYPOTHETICAL: NO 

: (iii) ANTI-SENSE: NO 



I (ix) FEATURE: 

I (A) NAME /KEY: CDS 

i (3) LOCATION: 2.. 1032 

! (ix) FEATURE: 

\ (A) NAME /KEY: mat_peptide 

^ (B) LOCATION: 2.. 1029 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

G ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA 4 6 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly 
15 10 15 

GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG 94 
Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys 
20 25 30 

GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC 142 
Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg 
35 40 45 

GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC 190 
Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu 
50 55 60 



HCVENV T 



TTT AGC CCC GGG TCG GCT 
Phe Ser Pro Gly Ser Ala 
65 

AGT TGG CAC ATC AAC AGG 
Ser Trp His He Asn Arg 
80 85 

ACA GGG TTC TTT GCC GCA 
Thr Gly Phe Phe Ala Ala 
100 

GGA TGC CCA GAG CGC TTG 
Gly Cys Pro Glu Arg Leu 
115 

CAG GGG TGG GGT CCC CTC 
Gin Gly Trp Gly Pro Leu 
130 

AGG CCC TAC TGC TGG CAC 
Arg Pro Tyr Cys Trp His 
145 

ZJGCG TCT CAG GTG TGC GGT 
?Ala Ser Gin Val Cys Gly 
>LSQ 165 

JGTG GTG GGG ACG ACC GAT 
JVal Val Glv Thr Thr Aso 

i 180 

"GCG AAC GAC TCG GAT GTG 

. Ala Asn Asp Ser Asp Val 
195 

iGGC AAC TGG TTC GGC TGT 

iGly Asn Trp Phe Gly Cys 
210 

J ACG TGT GGG GGC CCC CCG 

Thr Cys Gly Gly Pro Pro 
225 



TTG 


ACC 


TGC 


CCC 


ACT 


GAC 


Leu 


Thr 


Cys 


Pro 


Thr 


Asp 


240 










245 


GCC 


AGA 


TGC 


GGT 


TCT 


GGG 


Ala 


Arg 


Cys 


Gly 


Ser 


Gly 










260 




TAC 


CCA 


TAT 


AGG 


CTC 


TGG 


Tyr 


Pro 


Tyr 


Arg 


Leu 


Trp 








275 






TTC 


AAG 


GTT 


AGG 


ATG 


TAC 


Phe 


Lys 


Val 


Arg 


Met 


Tyr 






290 








GCA 


TGC 


AAT 


TGG 


ACT 


CGA 


Ala 


Cys 


Asn 


Trp 


Thr 


Arg 




CAG 
Gin 
70 


AAA 
Lys 


ATC 
He 


CAG 
Gin 


CTC 
Leu 


GTA 
Val 
75 


ACT 
Thr 


GCC 
Ala 


CTG 
Leu 


AAC 
Asn 


TGC 
Cys 
90 


AAC 
Asn 


CTA 
Leu 


TTC 
Phe 


TAC 
Tyr 


AAA 
Lys 
105 


CAC 
His 


AAA 
Lys 


GCC 
Ala 


AGC 
Ser 


TGT 
Cys 
120 


CGC 
Arg 


TCC 
Ser 


ATC 
He 


ACT 
Thr 


TAC 
Tyr 
135 


ACT 
Thr 


GAG 
Glu 


CCT 
Pro 


AAC 
Asn 


TAC 
Tyr 
150 


GCG 
Ala 


CCT 
Pro 


CGA 
Arg 


CCG 
Pro 


TGT 
Cys 
155 


CCA 
Pro 


GTG 
Val 


TAT 
Tyr 


TGC 
Cys 


TTC 
Phe 
170 


ACC 
Thr 


CGG 
Arg 


TTT 
Phe 


GGT 
Gly 


GTC 
Val 
185 


CCC 
Pro 


ACG 
Thr 


CTG 
Leu 


ATT 
He 


CTC 
Leu 
200 


AAC 
Asn 


AAC 
Asn 


ACG 
Thr 


ACA 
Thr 


TGG 
Trp 
215 


ATG 
Met 


AAT 
Asn 


GGC 
Gly 


ACT 
Thr 


TGC 
Cys 
230 


AAC 
Asn 


ATC 
He 


GGG 
Gly 


GGG 
Gly 


GCC 
Ala 
235 


TGT 
Cys 

CCC 
Pro 


TTT 
Phe 

TGG 
Trp 


CGG 
Arg 

CTG 
Leu 


AAG 
Lys 

ACA 
Thr 
265 


CAC 
His 
250 
CCT 
Pro 


CCC 
Pro 

AGG 
Arg 


CAC 
His 


TAC 
Tyr 


CCC 
Pro 
280 


TGC 
Cys 


ACT 
Thr 


GTC 
Val 


GTG 
Val 


GGG 
Gly 
295 


GGC 
Gly 


GTb 

Val 


GAG 
Glu 


CAC 
His 


GGA 
Gly 


GAG 
Glu 


CGT 
Arg 


TGT 
Cys 


GAC 
Asp 


TTG 
Leu 




AAC ACC AAC GGC 238 
Asn Thr Asn Gly 



GAC TCC CTC CAA 28 6 

Asp Ser Leu Gin 
95 

TTC AAC TCG TCT 334 
Phe Asn Ser Ser 
110 

GAC AAG TTC GCT 382 
Asp Lys Phe Ala 
125 

AGC TCG GAC CAG 430 

Ser Ser Asp Gin 

140 

GGT ATT GTA CCC 478 
Gly lie Val Pro 



CCG AGC CCT GTT 526 
Pro Ser Pro Val 
175 

TAT AAC TGG GGG 574 
Tyr Asn Trp Gly 
190 

CGG CCG CCG CGA 622 
Arg Pro Pro Arg 
205 

GGG TTC ACC AAG 670 

Gly Phe Thr Lys 

220 

GGC AAC AAC ACC 718 
Gly Asn Asn Thr 



GAG GCC ACC TAC 766 
Glu Ala Thr Tyr 
255 

TGT ATG GTT CAT 814 
Cys Met Val His 
270 

AAC TTC ACC ATC 862 
Asn Phe Thr He 
235 

AGG TTC GAA GCC 910 

Arg Phe Glu Ala 

300 

GAG GAC AGG GAT 958 
Glu Asp Arg Asp 



305 310 315 

AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GGT GAT CGA GGG 1006 
Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Gly Asp Arg Gly 
320 325 330 335 

CAG ACA CCA TCA CCA CCA TCA CTA AT AG 1034 
Gin Thr Pro Ser Pro Pro Ser Leu 
340 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 
^ 15 10 15 

Bleu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Tro Ala Lys Val 
m 20 25 ' 30 

rLeu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val 
y 35 40 45 

Njer Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe 
50 55 60 

tier pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser 

y;| 65 70 75 80 

-Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr 
't 85 90 95 

;: 4ly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly 
100 105 110 

Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala Gin 
115 120 125 

Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg 
130 135 140 

Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala 
145 150 155 160 

Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
165 170 175 

Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala 
180 185 190 

Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly 
195 200 205 



Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr 
210 215 220 

Cys Gly Gly Pro Pro Cys Asn lie Gly Gly Ala Gly Asn Asn Thr Leu 
225 230 235 240 

Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala 
245 250 255 

Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr 
260 265 270 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He Phe 
275 280 285 

Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala 
290 295 300 

Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 
305 310 315 320 

Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Gly Asp Arg Gly Gin 
325 330 335 



:Thr Pro Ser 



Pro Pro 
340 



Ser Leu 



03(2) INFORMATION FOR SEQ ID NO: 41: 

■2 (i) SEQUENCE CHARACTERISTICS: 

1 1 (A) LENGTH: 945 base pairs 

' H (B) TYPE: nucleic acid 

* (C) STRANDEDNESS : single 

O (D) TOPOLOGY: linear 

yl (ii) MOLECULE TYPE: cDNA 

Fy (iii) HYPOTHETICAL: NO 

H (iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..942 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1..939 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC 48 
Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
15 10 15 

GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT 96 
Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
20 25 30 



vn rrvy-T 



ACC AGG 
Thr Arg 



CAG CTC 
Gin Lea 
50 

AAC TGC 
Asn Cys 
65 

AAA CAC 
Lys His 



CGC TCC 
Arg Ser 



GAG CCT 
Glu Pro 



:;jCGA CCG 
%rg Pro 
0 130 



GGC CTT 
Gly Leu 
35 

GTA AAC 
Val Asn 



AAC GAC 
Asn Asp 

AAA TTC 
Lys Phe 



ATC GAC 
lie Asp 
100 

AAC AGC 
Asn Ser 
115 

TGT GGT 
Cys Gly 



GTG TCC 
Val Ser 



ACC AAC 
Thr Asn 



JTGC TTC 
JCys Phe 
1145 

:| GTC CCC 
Val Pro 



r=AAC AAC 
=Asn Asn 



^ AAT GGC 
Asn Gly 



ACC CCG 
Thr Pro 



ACG TAT 
Thr Tyr 



GGG GGG 
Gly Gly 
210 

AAG CAC 
Lys His 
225 

ACA CCT 
Thr Pro 



ACG CGG 
Thr Arg 
180 

ACT GGG 
Thr Gly 
195 

GCC GGC 
Ala Gly 



TCC CTC 
Ser Leu 
70 

AAC TCG 
Asn Ser 
85 

AAG TTC 
Lys Phe 



TCG GAC 
Ser Asp 



ATT GTA 
lie Val 



AGC CCT 
Ser Pro 
150 

AAC TGG 
Asn Trp 
165 

CCG CCG 
Pro Pro 



CTC TTT AGC 
Leu Phe Ser 
40 

GGC AGT TGG 
Gly Ser Trp 
55 

CAA ACA GGG 
Gin Thr Gly 



TCT GGA TGC 
Ser Gly Cys 



GCT CAG GGG 
Ala Gin Gly 
105 

CAG AGG CCC 
Gin Arg Pro 
120 

CCC GCG TCT 
Pro Ala Ser 
135 

GTT GTG GTG 
Val Val Val 



GGG GCG AAC 
Gly Ala Asn 



CCC GGG TCG 
Pro Gly Ser 



CAC ATC AAC 
His lie Asn 
60 

TTC TTT GCC 
Phe Phe Ala 
75 

CCA GAG CGC 
Pro Glu Arg 
90 

TGG GGT CCC 
Trp Gly Pro 



TAC TGC TGG 
Tyr Cys Trp 



CCC GAG 
Pro Glu 



AGG TGT 
Arg Cys 



TTC ACC 
Phe Thr 



AAC AAC 
Asn Asn 



GCC ACC 
Ala Thr 
230 

ATG GTT 
Met Val 
245 



CGA GGC AAC 
Arg Gly Asn 
185 

AAG ACG TGT 
Lys Thr Cys 
200 

ACC TTG ACC 
Thr Leu Thr 
215 

TAC GCC AGA 
Tyr Ala Arg 



CAT TAC CCA 
His Tyr Pro 



CAG GTG TGC 
Gin Val Cys 
140 

GGG ACG ACC 
Gly Thr Thr 
155 

GAC TCG GAT 
Asp Ser Asp 
170 

TGG TTC GGC 
Trp Phe Gly 



GCT CAG AAA ATC 
Ala Gin Lys lie 
45 

AGG ACT GCC CTG 
Arg Thr Ala Leu 



GCA CTA TTC TAC 
Ala Leu Phe Tyr 
80 

TTG GCC AGC TGT 
Leu Ala Ser Cys 
95 

CTC ACT TAC ACT 
Leu Thr Tyr Thr 
110 

CAC TAC GCG CCT 
His Tyr Ala Pro 
125 

GGT CCA GTG TAT 
Gly Pro Val Tyr 



GGG GGC CCC 
Gly Gly Pro 



TGC CCC ACT 
Cys Pro Thr 
220 

TGC GGT TCT 
Cys Gly Ser 
235 

TAT AGG CTC 
Tyr Arg Leu 
250 



GAT CGG TTT GGT 
Asp Arg Phe Gly 
160 

GTG CTG ATT CTC 
Val Leu lie Leu 
175 

TGT ACA TGG ATG 
Cys Thr Trp Met 
190 

CCG TGC AAC ATC 
Pro Cys Asn lie 
205 

GAC TGT TTT CGG 
Asp Cys Phe Arg 



GGG CCC TGG CTG 
Gly Pro Trp Leu 
240 

TGG CAC TAC CCC 
Trp His Tyr Pro 
255 



144 



192 



240 



238 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



TGC ACT 
Cys Thr 



GTC AAC 
Val Asn 
260 



TTC ACC 
Phe Thr 



ATC TTC AAG 
lie Phe Lys 
265 



GTT AGG ATG 
Val Arg Met 



TAC GTG GGG GGC 
Tyr Val Gly Gly 
270 



816 



GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT 



864 



HCVENV-T 



Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG 912 
Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
290 295 300 

TCT ACA ACA GAG TGG CAG AGC TTA ATT AAT TAG 945 
Ser Thr Thr Glu Trp Gin Ser Leu He Asn 
305 310 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

r" Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
p- 15 10 15 

■ '"bly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
± 20 25 30 

Nbhr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 

ifl 35 40 45 

... "Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 
^ 50 55 60 

^Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
I- 65 " 70 75 80 

■fjLys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
Cj 85 90 95 

Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 105 110 

Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 120 125 

Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
130 135 140 

Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
145 150 155 160 

Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
165 170 175 

Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
180 185 190 



Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 



Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 
210 215 220 

Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 
225 230 235 240 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 

Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
290 295 300 

Ser Thr Thr Glu Trp Gin Ser Leu lie Asn 
305 310 

(2) INFORMATION FOR SEQ ID NO: 43: 

£,! (i) SEQUENCE CHARACTERISTICS: 
y\ (A} LENGTH: 961 base pairs 

(B) TYPE: nucleic acid 
Yi (C) STRANDEDNESS: single 

^ (D) TOPOLOGY: linear 

lH (ii) MOLECULE TYPE: cDNA 

g " (iii) HYPOTHETICAL: NO 

*t (iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..958 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..955 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC 48 
Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
1 5 10 15 

GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT 96 
Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
20 25 30 

ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC 144 
Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 
35 40 45 

CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG 192 



Gin Leu Val Asn 
50 



Thr Asn Gly 
55 



Ser Trp His 



lie Asn 
60 



Arg Thr Ala Leu 



AAC TGC AAC GAC 
Asn Cys Asn Asp 
65 

AAA CAC AAA TTC 
Lys His Lys Phe 



TCC CTC CAA 
Ser Leu Gin 
70 

AAC TCG TCT 
Asn Ser Ser 
85 



ACA GGG TTC 
Thr Gly Phe 



TTT GCC 
Phe Ala 
75 



GCA CTA TTC TAC 
Ala Leu Phe Tyr 
80 



GGA TGC CCA 
Gly Cys Pro 
90 



GAG CGC TTG GCC AGC TGT 
Glu Arg Leu Ala Ser Cys 
95 



240 



288 



CGC TCC ATC GAC 
Arg Ser lie Asp 
100 



AAG TTC GCT 
Lys Phe Ala 



CAG GGG TGG 
Gin Gly Trp 
105 



GGT CCC CTC ACT TAC ACT 
Gly Pro Leu Thr Tyr Thr 
110 



336 



GAG CCT AAC AGC 
Glu Pro Asn Ser 
115 



TCG GAC CAG 
Ser Asp Gin 



AGG CCC TAC 
Arg Pro Tyr 
120 



TGC TGG CAC TAC GCG CCT 
Cys Trp His Tyr Ala Pro 
125 



384 



CGA CCG TGT GGT 
Arg Pro Cys Gly 
130 

f TGC TTC ACC CCG 
-Cys Phe Thr Pro 
J 145 

|GTC CCC ACG TAT 
; Val Pro Thr Tyr 

1 AAC AAC ACG CGG 
Asn Asn Thr Arg 
180 



ATT GTA CCC 
He Val Pro 
135 

AGC CCT GTT 
Ser Pro Val 
150 

AAC TGG GGG 
Asn Trp Gly 
165 

CCG CCG CGA 
Pro Pro Arg 



GCG TCT CAG 
Ala Ser Gin 



GTG GTG GGG 
Vai Val Gly 



GTG TGC GGT CCA GTG TAT 4 32 

Val Cys Gly Pro Val Tyr 
140 

ACG ACC GAT CGG TTT GGT 480 
Thr Thr Asp Arg Phe Gly 
155 160 

GCG AAC GAC TCG GAT GTG CTG ATT CTC 528 
Ala Asn Asp Ser Asd Val Leu He Leu 
170 * 175 

GGC AAC TGG TTC GGC TGT ACA TGG ATG 57 6 

Gly Asn Trp Phe Gly Cys Thr Trp Met 
185 190 



AAT GGC ACT GGG 
Asn Gly Thr Gly 
195 



TTC ACC AAG 
Phe Thr Lys 



ACG TGT GGG 
Thr Cys Gly 
200 



GGC CCC CCG TGC AAC ATC 
Gly Pro Pro Cys Asn lie' 
205 



624 



GGG GGG GCC GGC 
Gly Gly Ala Gly 
210 

AAG CAC CCC GAG 
Lys His Pro Glu 
225 

ACA CCT AGG TGT 
Thr Pro Arg Cys 



AAC AAC ACC 
Asn Asn Thr 
215 

GCC ACC TAC 
Ala Thr Tyr 
230 

ATG GTT CAT 
Met Val His 
245 



TTG ACC TGC 
Leu Thr Cys 



GCC AGA TGC 
Ala Arg Cys 



TAC CCA TAT 
Tyr Pro Tyr 
250 



CCC ACT 
Pro Thr 
220 

GGT TCT 
Gly Ser 
235 



GAC TGT TTT CGG 
Asp Cys Phe Arg 



GGG CCC TGG CTG 
Gly Pro Trp Leu 
240 



AGG CTC TGG CAC TAC CCC 
Arg Leu Trp His Tyr Pro 
255 



672 



720 



768 



TGC ACT GTC AAC 
Cys Thr Val Asn 
260 



TTC ACC ATC 
Phe Thr He 



TTC AAG GTT 
Phe Lys Val 
265 



AGG ATG TAC GTG GGG GGC 
Arg Met Tyr Val Gly Gly 
270 



816 



GTG GAG CAC AGG 
Val Glu His Arg 
275 



TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT 
Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
280 285 



864 



TGT GAC TTG GAG 
Cys Asp Leu Glu 



GAC AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG 
Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 



912 



290 295 300 

TCT ACA ACA GGT GAT CGA GGG CAG ACA CCA TCA CCA CCA TCA CTA A 958 
Ser Thr Thr Gly Asp Arg Gly Gin Thr Pro Ser Pro Pro Ser Leu 
305 310 315 

TAG 961 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 319 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
15 10 15 

Gly Val Aso Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
□ 20 25 30 



mhr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 
•rj 35 40 4 5 

IGln Leu Val Asn Thr Asn Gly Ser Trp His He Asn Ara Thr Ala Leu 
2 50 55 60 

J Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
65 70 75 80 

SLys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
*f 85 90 95 



HlArg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
y3 100 105 110 

Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 120 125 

Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
130 135 140 

Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
145 150 155 160 

Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
165 170 175 

Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
180 185 190 

Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 

Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 

210 215 220 

Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 



225 230 235 240 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 

Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
290 295 300 

Ser Thr Thr Gly Asp Arg Gly Gin Thr Pro Ser Pro Pro Ser Leu 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1392 

(ix) FEATURE: 

(A) NAME/ KEY : mat_peptide 

(B) LOCATION: 1..1389 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



ATG 


GTG 


GCG 


GGG 


GCC 


CAT 


TGG 


GGA 


GTC 


CTG 


GCG 


GGC CTC GCC TAC TAT 


48 


Met 


Val 


Ala 


Gly 


Ala 


His 


Trp 


Gly 


Val 


Leu 


Ala 


Gly Leu Ala Tyr Tyr 




1 








5 










10 




15 




TCC 


ATG 


GTG 


GGG 


AAC 


TGG 


GCT 


AAG 


GTT 


TTG 


GTT 


GTG ATG CTA CTC TTT 


96 


Ser 


Met 


Val 


Gly 


Asn 


Trp 


Ala 


Lys 


Val 


Leu 


Val 


Val Met Leu Leu Phe 










20 










25 






30 




GCC 


GGC 


GTC 


GAC 


GGG 


CAT 


ACC 


CGC 


GTG 


TCA 


GGA 


GGG GCA GCA GCC TCC ' 


144 


Ala 


Gly Val 


Asp 


Gly 


His 


Thr 


Arg 


Val 


Ser 


Gly 


Gly Ala Ala Ala Ser 








35 










40 








45 




GAT 


ACC 


AGG 


GGC 


CTT 


GTG 


TCC 


CTC 


TTT 


AGC 


CCC 


GGG TCG GCT CAG AAA 


192 


Asp Thr Arg 


Gly 


Leu 


Val 


Ser 


Leu 


Phe 


Ser 


Pro 


Gly Ser Ala Gin Lys 






50 










55 










60 




ATC 


CAG 


CTC 


GTA 


AAC 


ACC 


AAC 


GGC 


AGT 


TGG 


CAC 


ATC AAC AGG ACT GCC 


240 


He 


Gin 


Leu 


Val 


Asn 


Thr 


Asn 


Gly 


Ser 


Trp 


His 


He Asn Arg Thr Ala 




65 










70 










75 


80 





CTG AAC TGC AAC GAG TCC CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC 288 
Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe 
85 90 95 

TAC AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC 336 
Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser 
100 105 110 

TGT CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC 384 
Cys Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr 
115 120 125 

ACT GAG CCT AAC AGC TCG GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG 432 
Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala 
130 135 140 

CCT CGA CCG TGT GGT ATT GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG 480 
Pro Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val 
145 150 155 160 

TAT TGC TTC ACC CCG AGC CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT 528 
Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe 
3 165 170 175 

OGGT GTC CCC ACG TAT AAC TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT 57 6 

llGly Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu lie 

S 180 185 190 

t CTC AAC AAC ACG CGG CCG CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG 624 

'5 Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp 

J 195 200 205 

3 ATG AAT GGC ACT GGG TTC ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC 672 

IMet Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn 
I 210 215 220 



^ ATC GGG GGG GCC GGC AAC AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT 720 
^Ile Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe 
^225 230 235 240 

CGG AAG CAC CCC GAG GCC ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG 7 68 

Arg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp 
245 250 255 

CTG ACA CCT AGG TGT ATG GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC 816 

Leu Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr 

260 265 270 

CCC TGC ACT GTC AAC TTC ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG 864 

Pro Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly 

275 280 285 

GGC GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG 912 
Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu 
290 295 300 

CGT TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG 960 
Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu 
305 310 315 320 

CTG TCT ACA ACA GAG TGG CAG ATA CTG CCC TGT TCC TTC ACC ACC CTG 1008 



Leu Ser Thr Thr Glu Trp Gin He Leu Pro Cys Ser Phe Thr Thr Leu 
325 330 335 

CCG GCC CTA TCC ACC GGC CTG ATC CAC CTC CAT CAG AAC ATC GTG GAC 1056 
Pro Ala Leu Ser Thr Gly Leu He His Leu His Gin Asn He Val Asp 
340 345 350 

GTG CAA TAC CTG TAC GGT GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC 1104 
Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val Val Ser Leu Val He 
355 360 365 

AAA TGG GAG TAT GTC CTG TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC 1152 
Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg 
370 375 380 

ATC TGC GCC TGC TTA TGG ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC 1200 
He Cys Ala Cys Leu Trp Met Met Leu Leu He Ala Gin Ala Glu Ala 
385 390 395 400 

GCC TTA GAG AAC CTG GTG GTC CTC AAT GCG GCG GCC GTG GCC GGG GCG 124 8 

Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala 
405 410 415 

•Scat ggc act ctt tcc ttc ctt gtg ttc ttc tgt gct gcc tgg tac atc 1296 

OHis Gly Thr Leu Ser Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr He 
g 420 425 430 

iAAG GGC AGG CTG GTC CCT GGT GCG GCA TAC GCC TTC TAT GGC GTG TGG 1344 
':Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp 
% 435 440 445 

J CCG CTG CTC CTG CTT CTG CTG GCC TTA CCA CCA CGA GCT TAT GCC TAGTAA 1395 

Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
] 450 455 460 

^(2) INFORMATION FOR SEQ ID NO: 46: 

3 (i) SEQUENCE CHARACTERISTICS: 

j (A) LENGTH: 4 63 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr 
15 10 15 

Ser Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe 
20 25 30 

Ala Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser 
35 ~ 40 45 

Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys 
50 55 60 



He Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala 
65 70 75 80 



Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe 
85 90 95 

Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser 
100 105 110 

Cys Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr 
115 120 125 

Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala 
130 135 140 

Pro Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val 
145 150 155 160 

Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe 
165 170 175 

Gly Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He 
180 185 190 

Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp 
195 200 205 



npet Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn 
: a 210 215 220 

l ; ile Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe 
^25 230 235 240 

llkrg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp 
SJ 245 250 255 

-ieu Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr 
% 260 265 270 

^Pro Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly 
HJ 275 280 285 

CfHy Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu 
290 295 300 

Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu 
305 310 315 320 

Leu Ser Thr Thr Glu Trp Gin He Leu Pro Cys Ser Phe Thr Thr Leu 
325 330 335 

Pro Ala Leu Ser Thr Gly Leu He His Leu His Gin Asn He Val Asp 
340 345 350 

Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val Val Ser Leu Val He 
355 360 365 

Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg 
370 375 380 

He Cys Ala Cys Leu Trp Met Met Leu Leu He Ala Gin Ala Glu Ala 
385 390 395 400 

Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala 




405 

His Gly Thr Leu Ser Phe Leu Val 
420 

Lys Gly Arg Leu Val Pro Gly Ala 
435 440 

Pro Leu Leu Leu Leu Leu Leu Ala 
450 455 




410 415 

Phe Phe Cys Ala Ala Trp Tyr He 
425 430 

Ala Tyr Ala Phe Tyr Gly Val Trp 
445 

Leu Pro Pro Arg Ala Tyr Ala 
460 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .2079 
(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1. .2076 



'Q. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

^JUVT TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 

■%sn Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
Si 15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 14 4 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 



TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 
Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

MGTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 
Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 
Glv Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 
Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
130 185 190 

O^CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 
'fl Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
|I| 195 200 205 

CI TGG TCG CCT ACA ACG GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA 
< = Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
210 215 220 

e CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG 
Q Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
10 225 230 235 240 

% GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT 
l 't Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu Val 
245 . 250 255 

GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA 
Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val Ser Gly 
260 265 270 

GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC 
Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro 
275 280 285 

GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC 
Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser Trp His 
290 295 300 

ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC 
He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe 
305 310 315 320 

TTT GCC GCA CTA TTC TAC AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA 
Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro 
325 330 335 

GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala Gin Gly Trp 
340 345 350 

GGT CCC CTC ACT TAC ACT GAG CCT AAC AGC TCG GAC CAG AGG CCC TAC 1104 
Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr 
355 360 365 

TGC TGG CAC TAC GCG CCT CGA CCG TGT GGT ATT GTA CCC GCG TCT CAG 1152 
Cvs Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala Ser Gin 
370 375 380 

GTG TGC GGT CCA GTG TAT TGC TTC ACC CCG AGC CCT GTT GTG GTG GGG 1200 
Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly 
385 390 395 400 

ACG ACC GAT CGG TTT GGT GTC CCC ACG TAT AAC TGG GGG GCG AAC GAC 1248 

Thr Thr Aso Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp 

405 410 415 

TCG GAT GTG CTG ATT CTC AAC AAC ACG CGG CCG CCG CGA GGC AAC TGG 1296 

Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp 
420 425 430 

TTC GGC TGT ACA TGG ATG AAT GGC ACT GGG TTC ACC AAG ACG TGT GGG 1344 
Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly 
435 440 445 

GGC CCC CCG TGC AAC ATC GGG GGG GCC GGC AAC AAC ACC TTG ACC TGC 1392 
Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys 
450 455 460 

CCC ACT GAC TGT TTT CGG AAG CAC CCC GAG GCC ACC TAC GCC AGA TGC 144 0 

Pro Thr Aso Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys 
465 * 470 475 480 

GGT TCT GGG CCC TGG CTG ACA CCT AGG TGT ATG GTT CAT TAC CCA TAT 1488 
Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr Pro Tyr 
485 490 495 

AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC TTC ACC ATC TTC AAG GTT 1536 
Arg Leu Tro His Tyr Pro Cys Thr Val Asn Phe Thr He Phe Lys Val 
500 505 510 

AGG ATG TAC GTG GGG GGC GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT 1584 
Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn 
515 520 525 

TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG 1632 
Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu 
530 535 540 

CTT AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG CAG ATA CTG CCC TGT 1680 
Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin He Leu Pro Cys 
545 550 555 560 

TCC TTC ACC ACC CTG CCG GCC CTA TCC ACC GGC CTG ATC CAC CTC CAT 1728 
Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His Leu His 
565 570 575 

CAG AAC ATC GTG GAC GTG CAA TAC CTG TAC GGT GTA GGG TCG GCG GTT 1776 
Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val 
580 585 590 



GTC 


TCC 


CTT 


GTC 


ATC 


AAA 


TGG GAG 


TAT GTC CTG TTG 


CTC 


TTC 


CTT 


CTC 


1824 


Val 


Ser 


Leu 


Val 


lie 


Lvs 


Trp Glu 


Tvr Val Leu Leu 


Leu 


Phe 


Leu 


Leu 








595 








600 


605 










CTG 


GCA 


GAC 


GCG 


CGC 


ATC 


TGC GCC 


TGC TTA TGG ATG 


ATG 


CTG 


CTG 


ATA 


1872 


L6U 


Ala 


Asp 


Ala 


Arcr 


He 


Cys Ala 


Cys Leu Trp Met 


Met 


Leu 


Leu 


He 






610 










615 


620 












GCT 


CAA 


GCT 


GAG 


GCC 


GCC 


TTA GAG 


AAC CTG GTG GTC 


CTC 


AAT 


GCG 


GCG 


1920 


Ala 


Gin 


Ala 


Glu 


Ala 


Ala 


Leu Glu 


Asn Leu Val Val 


Leu 


Asn 


Ala 


Ala 




625 










630 




635 








640 




GCC 


GTG 


GCC 


GGG 


GCG 


CAT 


GGC ACT 


CTT TCC TTC CTT 


GTG 


TTC 


TTC 


TGT 


1968 


Ala 


Val 


Ala 


Gly 

J; 


Ala 


His 


Gly Thr 


Leu Ser Phe Leu 


Val 


Phe 


Phe 


Cys 












645 






650 






655 






GCT 


GCC 


TGG 


TAC 


ATC 


AAG 


GGC AGG 


CTG GTC CCT GGT 


GCG 


GCA 


TAC 


GCC 


2016 


Ala 


Ala 


TrD 


Tvr 


He 


Lys 


Glv Arcr 


Leu Val Pro Gly 


Ala 


Ala 


Tvr 


Ala 










660 








665 




670 








TTC 


TAT 


GGC 


GTG 


TGG 


CCG 


CTG CTC 


CTG CTT CTG CTG 


GCC 


TTA 


CCA 


CCA 


2064 


Phe 


Tyr 


Gly 


Val 


Trp 


Pro 


Leu Leu 


Leu Leu Leu Leu 


Ala 


Leu 


Pro 


Pro 






675 








680 




685 










lOCGA 


GCT 


TAT 


GCC 


TAGTAA 














2082 


lArg 


Ala 


Tyr 


Ala 





















690 



; 2(2) INFORMATION FOR SEQ ID NO: 48: 

; ^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 amino acids 
□ (B) TYPE: amino acid 

"n (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
T l 5 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 



Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
210 215 220 

Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
225 230 235 240 

Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu Val 
^ 245 250 255 

l -~t Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val Ser Gly 
^ 260 265 270 

■J! Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro 

'-'4 275 280 285 

Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser Trp His 
"t 290 295 300 



; " He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe 
y 305 310 315 320 



-j Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro 
325 330 335 

Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala Gin Gly Trp 
340 345 350 

Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr 
355 360 365 

Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala Ser Gin 
370 375 380 

Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly 
385 390 395 400 

Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp 
405 410 415 

Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp 
420 425 430 



Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly 



435 440 445 

Gly Pro Pro Cys Asn lie Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys 
450 455 460 

Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys 
465 470 475 480 

Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr Pro Tyr 
485 490 495 

Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr lie Phe Lys Val 
500 505 510 

Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn 
515 520 525 

Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu 
530 535 540 

Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin lie Leu Pro Cys 
545 550 555 560 

: lSer Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu lie His Leu His 
5 565 570 575 

rbln Asn He Val Asp Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val 
ff 580 585 590 

\Val Ser Leu Val He Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu 
H 595 600 605 

Leu Ala Asp Ala Arg He Cys Ala Cys Leu Trp Met Met Leu Leu He 
^ 610 615 620 

^Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala 
: ^625 630 635 640 

riAla Val Ala Gly Ala His Gly Thr Leu Ser Phe Leu Val Phe Phe Cys 
j 645 650 655 

Ala Ala Trp Tyr He Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala 
660 665 670 

Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro 
675 680 685 

Arg Ala Tyr Ala 
690 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE : NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2430 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..2427 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

ATG AGC ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA CGT AAC ACC AAC 48 
Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

CGC CGC CCA CAG GAC GTC AAG TTC CCG GGC GGT GGT CAG ATC GTT GGT 96 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Glv Gin lie Val Gly 
20 25 30 

GGA GTT TAC CTG TTG CCG CGC AGG GGC CCC AGG TTG GGT GTG CGC GCG 14 4 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

ACT AGG AAG ACT TCC GAG CGG TCG CAA CCT CGT GGG AGG CGA CAA CCT 192 
Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

ATC CCC AAG GCT CGC CGA CCC GAG GGT AGG GCC TGG GCT CAG CCC GGG 240 
He Pro Lys Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gin Pro Gly 
65 70 75 80 

TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC ATG GGG TGG GCA GGA TGG 288 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Met Gly Trp Ala Gly Trp 
85 90 95 

CTC CTG TCA CCC CGC GGC TCT CGG CCT AGT TGG GGC CCT ACA GAC CCC 336 
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 

CGG CGT AGG TCG CGT AAT TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC 384 
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

GGC TTC GCC GAC CTC GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA 432 
Gly Phe Ala Asp Leu Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 

GGG GGC GCT GCC AGG GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC 480 
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

GGC GTG AAC TAT GCA ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC 528 
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

TTC CTC TTG GCT TTG CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT 57 6 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 



180 



185 



190 



GAA GTG CGC AAC GTG TCC GGG ATG TAC CAT GTC ACG AAC GAG TGC TCC 
Glu Val Arg Asn Val Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser 
195 200 205 



624 



AAC TCA AGC ATT GTG TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC 
Asn Ser Ser lie Val Tyr Glu Ala Ala Asp Met lie Met His Thr Pro 
210 215 220 



672 



GGG TGC GTG CCC TGC GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA 
Gly Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val 
225 230 235 240 



720 



GCG CTC ACC CCC ACG CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG 
Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr 
245 250 255 



768 



ACA ATA CGA CGC CAC GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT 
Thr He Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys 
260 265 270 



816 



TCC GCT ATG TAC GTG GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC 
^Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser 
! 275 280 285 



864 



|CAG CTG 
iGln Leu 
: 290 



TTC ACC 
Phe Thr 



ATC TCG 
He Ser 



CCT CGC CGG 
Pro Arg Arg 
295 



CAT GAG 
His Glu 



ACG GTG 
Thr Val 
300 



CAG GAC TGC 
Gin Asp Cys 



912 



^AAT TGC 
JAsn Cys 
305 



TCA ATC 
Ser He 



TAT CCC 
Tyr Pro 
310 



GGC CAC ATA 
Gly His He 



ACG GGT 
Thr Gly 
315 



CAC CGT 
His Arg 



ATG GCT TGG 
Met Ala Trp 
320 



960 



•^GAT ATG 
; Asp Met 



ATG ATG 
Met Met 



AAC TGG 
Asn Trp 
325 



TCG CCT ACA 
Ser Pro Thr 



ACG GCC 
Thr Ala 
330 



CTG GTG 
Leu Val 



GTA TCG CAG 
Val Ser Gin 
335 



1008 



^CTG CTC 
|Leu Leu 



CGG ATC 
Arg He 
340 



CCA CAA 
Pro Gin 



GCT GTC GTG 
Ala Val Val 
345 



GAC ATG 
Asp Met 



GTG GCG 
Val Ala 



GGG GCC CAT 
Gly Ala His 
350 



1056 



TGG GGA 
Trp Gly 



GTC CTG 
Val Leu 
355 



GCG GGC 
Ala Gly 



CTC GCC TAC 
Leu Ala Tyr 
360 



TAT TCC 
Tyr Ser 



ATG GTG 
Met Val 
365 



GGG AAC TGG 
Gly Asn Trp 



1104 



GCT AAG 
Ala Lys 
370 



GTT TTG 
Val Leu 



GTT GTG 
Val Val 



ATG CTA CTC 
Met Leu Leu 
375 



TTT GCC 
Phe Ala 



GGC GTC 
Gly Val 
380 



GAC GGG CAT 
Asp Gly His 



1152 



ACC CGC 
Thr Arg 
385 



GTG TCA 
Val Ser 



GGA GGG 
Gly Gly 
390 



GCA GCA GCC 
Ala Ala Ala 



TCC GAT 
Ser Asp 
395 



ACC AGG 
Thr Arg 



GGC CTT GTG 
Gly Leu Val 
400 



1200 



TCC CTC 
Ser Leu 



TTT AGC 
Phe Ser 



CCC GGG 
Pro Gly 
405 



TCG GCT CAG 
Ser Ala Gin 



AAA ATC 
Lys He 
410 



CAG CTC 
Gin Leu 



GTA AAC ACC 
Val Asn Thr 
415 



1248 



AAC GGC 
Asn Gly 



AGT TGG 
Ser Trp 
420 



CAC ATC 
His He 



AAC AGG ACT 
Asn Arg Thr 
425 



GCC CTG 
Ala Leu 



AAC TGC 
Asn Cys 



AAC GAC TCC 
Asn Asp Ser 
430 



1296 



CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAC AAA CAC AAA TTC AAC 
Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn 
435 440 445 



1344 



TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG 
Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser lie Asp Lys 
450 455 460 



1392 



TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC ACT GAG CCT AAC AGC TCG 
Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser 
465 470 475 480 



1440 



GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG CCT CGA CCG TGT GGT ATT 
Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly lie 
485 490 495 



1488 



GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT TGC TTC ACC CCG AGC 
Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 
500 505 510 



1536 



CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT GTC CCC ACG TAT AAC 
Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn 
515 520 525 



1584 



TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC AAC AAC ACG CGG CCG 
Trp Gly Ala Asn Asp Ser Asp Val Leu lie Leu Asn Asn Thr Arg Pro 
530 535 540 



1632 



CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG ATG AAT GGC ACT GGG TTC 
Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe 
545 550 555 560 



1680 



ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC GGG GGG GCC GGC AAC 
Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn lie Gly Gly Ala Gly Asn 
565 570 575 



1728 



AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG AAG CAC CCC GAG GCC 
Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala 
580 585 590 



1776 



ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG ACA CCT AGG TGT ATG 
Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met 
595 600 605 



1824 



GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC TTC 
Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe 
610 615 620 



1872 



ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG GGC GTG GAG CAC AGG TTC 
Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe 
625 630 635 640 



1920 



GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG GAC 
Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp 
645 650 655 



1968 



AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG 
Arg Aso Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp 
660 665 670 



2016 



CAG ATA CTG CCC TGT TCC TTC ACC ACC CTG CCG GCC CTA TCC ACC GGC 2064 

Gin lie Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 
675 680 685 

CTG ATC CAC CTC CAT CAG AAC ATC GTG GAC GTG CAA TAG CTG TAC GGT 2112 

Leu lie His Leu His Gin Asn lie Val Asp Val Gin Tyr Leu Tyr Gly 

690 695 700 

GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC AAA TGG GAG TAT GTC CTG 2160 

Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val Leu 
705 710 715 720 

TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC ATC TGC GCC TGC TTA TGG 2208 

Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu Trp 
725 730 735 

ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC GCC TTA GAG AAC CTG GTG 2256 

Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val 
740 745 750 

GTC CTC AAT GCG GCG GCC GTG GCC GGG GCG CAT GGC ACT CTT TCC TTC 2304 

Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser Phe 
755 760 765 

2352 



2400 



CTT 


GTG 


TTC 


TTC 


TGT 


GCT 


GCC 


TGG 


TAC 


ATC 


AAG 


GGC AGG CTG GTC 


CCT 


Leu 


Val 


Phe 


Phe 


Cys 


Ala 


Ala 


Trp 


Tyr 


He 


Lys 


Gly Arg Leu Val 


Pro 




770 










775 










780 




GGT 


GCG 


GCA 


TAC 


GCC 


TTC 


TAT 


GGC 


GTG 


TGG 


CCG 


CTG CTC CTG CTT 


CTG 


Gly 


Ala 


Ala 


Tyr 


Ala 


Phe 


Tyr 


Gly 


Val 


Trp 


Pro 


Leu Leu Leu Leu 


Leu 


785 










790 










795 




800 


CTG 


GCC 


TTA 


CCA 


CCA 


CGA 


GCT 


TAT 


GCC 


TAGTAA 






Leu 


Ala 


Leu 


Pro 


Pro 


Arg 


Ala 


Tyr 


Ala 


















805 










810 









; (2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 



2433 



He Pro Lys Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gin Pro Gly 
65 70 75 80 



Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Met Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Val Gly Tyr lie Pro Leu Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 
180 185 190 

Glu Val Arg Asn Val Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser 
195 200 205 

Asn Ser Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr Pro 
210 215 220 

Gly Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val 
225 230 235 240 

Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr 
245 250 255 

Thr He Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys 
260 265 270 

Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser 
275 280 285 

Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys 
* 290 295 300 

Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin 
325 330 335 

Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His 
340 345 350 

Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp 
355 360 365 

Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His 
370 375 380 



Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val 
385 390 395 400 



Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys 
450 455 460 

Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser 
465 470 475 480 

Asv Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He 
485 490 495 

Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 
500 505 510 

Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn 
515 520 525 

Tro Gly Ala Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro 
530 535 540 

Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe 
545 550 555 560 

Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn 
565 570 575 

Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala 
580 585 590 

Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met 
595 600 ' 605 

Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe 
610 615 620 

Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe 
625 630 635 640 

Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp 
645 650 655 

Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp 
660 665 670 

Gin He Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 
675 680 685 

Leu He His Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly 
690 695 700 



Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val Leu 
705 710 715 720 

Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu Trp 












725 






730 


735 


Met 


Met 


Leu 


Leu 
740 


lie 


Ala 


Gin 


Ala Glu Ala Ala 
745 


Leu Glu Asn Leu Val 
750 


Val 


Leu 


Asn 
755 


Ala 


Ala 


Ala 


Val 


Ala Gly Ala His 
760 


Gly Thr Leu Ser Phe 
765 


Leu 


Val 
770 


Phe 


Phe 


Cys 


Ala 


Ala 
775 


TrD Tvr Tie Lvs 


Glv Arer Lpu VaI Pro 
780 


Gly Ala 
785 

Leu Ala 


Ala 
Leu 


Tyr 
Pro 


Ala 

Pro 
805 


Phe 
790 
Arg 


Tyr 
Ala 


Gly Val Trp Pro 
795 

Tyr Ala 


Leu Leu Leu Leu Leu 
800 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 1. . 17 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Ser Asn Ser Ser Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys 
15 10 15 

Val 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1..22 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Gly Gly He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 




1 5 * 10 15 

Ser Pro Thr Thr Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1..37 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Tyr Glu Val Arg Asn Val Ser Gly lie Tyr His Val Thr Asn Asp Cys 
15 10 15 

Ser Asn Ser Ser lie Val Tyr Glu Ala Ala Asp Met lie Met His Thr 
20 25 30 

Pro Gly Cys Gly Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 1..25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Gly Gly Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu Pro Ala Thr 
15 10 15 

Gin Leu Arg Arg His lie Asp Leu Leu 
20 25 

(2} INFORMATION FOR SEQ ID NO: 55: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 




(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1..25 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Gly Gly Thr Pro Thr Leu Ala Ala Arg Asp Ala Ser Val Pro Thr Thr 
15 10 15 

Thr lie Arg Arg His Val Asp Leu Leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

{C5 STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Gin Val Arg Asn 
15 10 15 

Ser Thr Gly Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Aso Cys Pro 
1 5 10 15 

Asn Ser Ser lie 
20 

(2) INFORMATION FOR SEQ ID NO: 58: 




(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Asn Asp Cys Pro Asn Ser Ser lie Val Tyr Glu Ala His Asp Ala lie 
15 10 15 

Leu His Thr Pro 
20 

{2} INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Ser Asn Ser Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr 
15 10 15 

Pro Gly Cys Val 
20 

(2} INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

His Asp Ala He Leu His Thr Pro Gly Val Pro Cys Val Arg Glu Gly 
1-5 10 15 

Asn Val Ser 



(2} INFORMATION FOR SEQ ID NO: 61: 
(i) SEQUENCE CHARACTERISTICS: 




(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii} MOLECULE TYPE: peptide 



{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Cys Val Arg Glu Gly Asn Val Ser Arg Cys Trp Val Ala Met Thr Pro 
15 10 15 

Thr Val Ala Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Ala Met Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu Pro Ala Thr 
15 10 15 

Gin Leu Arg Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Leu Pro Ala Thr Gin Leu Arg Arg His He Asp Leu Leu Val Gly Ser 
15 10 15 

Ala Thr Leu Cys 
20 

(2) INFORMATION FOR SEQ ID NO: 64: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Leu Vai Gly Ser Ala Thr Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu 
15 10 15 

Cys Gly Ser Val 
20 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



fy (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

SJ Gin Leu Phe Thr Phe Ser Pro Arg Arg His Trp Thr Thr Gin Gly Cys 

LCI 15 10 15 

Asn Cys Ser He 

J3(2) INFORMATION FOR SEQ ID NO: 66: 

m (i) SEQUENCE CHARACTERISTICS: 

% (A) LENGTH: 20 amino acids 

n (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Thr Gin Gly Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His 
15 10 15 

Arg Met Ala Trp 
20 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 




(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

lie Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro 
15 10 15 

Thr Ala Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Asn Trp Ser Pro Thr Ala Ala Leu Val Met Ala Gin Leu Leu Arg lie 
15 10 15 

Pro Gin Ala lie 
20 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Leu Leu Arg lie Pro Gin Ala lie Leu Asp Met lie Ala Gly Ala His 
15 10 15 

Trp Gly Val Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Ala Gly Ala His Trp Gly Val Leu Ala Gly He Ala Tyr Phe Ser Met 
15 10 15 

Val Gly Asn Met 
20 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Val Val Leu Leu Leu Phe Ala Gly Val Asp Ala Glu Thr He Val Ser 
15 10 15 

Gly Gly Gin Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Ser Gly Leu Val Ser Leu Phe Thr Pro Gly Ala Lys Gin Asn He Gin 
15 10 15 

Leu He Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Gin Asn lie Gin Leu He Asn Thr Asn Gly Gin Trp His He Asn Ser 
15 10 15 

Thr Ala Leu Asn 
20 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Leu Asn Cys Asn Glu Ser Leu Asn Thr Gly Trp Tro Leu Ala Gly Leu 
15 10 15 

He Tyr Gin His Lys 
20 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

. (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ala Gly Leu He Tyr Gin His Lys Phe Asn Ser Ser Gly Cys Pro Glu 
15 10 15 

Arg Leu Ala Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Gly Cys Pro Glu Arg Leu Ma Ser Cys Arg Pro Leu Thr Asp Phe Asp 
15 10 15 

Gin Gly Trp Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



■J {xi} SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

■J Thr Asp Phe Asp Gin Gly Trp Gly Pro lie Ser Tyr Ala Asn Gly Ser 

"j 1 5 10 15 

,j Gly Pro Asp Gin 

n 20 

12) INFORMATION FOR SEQ ID NO: 78: 

3 (i) SEQUENCE CHARACTERISTICS: 
□ (A) LENGTH: 20 amino acids 

^ (B) TYPE: amino acid 

= (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Ala Asn Gly Ser Gly Pro Asp Gin Arg Pro Tyr Cys Trp His Tyr Pro 
15 10 15 

Pro Lys Pro Cys 

20 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Trp His Tyr Pro Pro Lys Pro Cys Gly lie Val Pro Ala Lys Ser Val 
1-5 10 15 

Cys Gly Pro Val 
20 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Ala Lys Ser Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val 
15 10 15 

Val Val Gly Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



"(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Ser Gly Ala Pro Thr 
15 10 15 

Tyr Ser Trp Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Gly Ala Pro Thr Tyr Ser Trp Gly Glu Asn Asp Thr Asp Val Phe Val 
15 10 15 

Leu Asn Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys 
O 1 5 10 "15 

S Val Cys Gly Ala 

fil 20 

*fi{2) INFORMATION FOR SEQ ID NO: 84: 

HI (i) SEQUENCE CHARACTERISTICS: 
H (A) LENGTH: 20 amino acids 

3 (B) TYPE: amino acid 

fi (C) STRANDEDNESS: single 

% (D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Gly Phe Thr Lys Val Cys Gly Ala Pro Pro Val Cys He Gly Gly Ala 
15 10 15 

Gly Asn Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 




lie Gly Gly Ala Gly Asn Asn Thr Leu His Cys Pro Thr Asp Cys Arg 
15 10 15 

Lys His Pro 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi] SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Thr Asp Cys Phe Arg Lys His Pro Asp Ala Thr Tyr Ser Arg Cys Gly 
15 10 15 

Ser Gly Pro Trp 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii} MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Ser Arg Cys Gly Ser Gly Pro Trp He Thr Pro Arg Cys Leu Val Asp 
15 10 15 

Tyr Pro Tyr Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) 
Cys 



SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Leu Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr He 




15 10 15 

Asn Tyr Thr He 
20 

(2) INFORMATION FOR SEQ ID NO: 89: 

<i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Pro Cys Thr He Asn Tyr Thr He Phe Lys He Arg Met Tyr Val Gly 
15 10 15 

Gly Val Glu His 
20 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Met Tyr Val Gly Gly Val Glu His Arg Leu Glu Ala Ala Cys Asn Trp 
15 10 15 

Thr Pro Gly Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Ala Cys Asn Trp Thr Pro Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp 
15 10 15 




Arg Ser Glu Leu 
20 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Thr Thr Thr 
15 10 15 

Gin Trp Gin Val 
20 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

Tyr Gin Val Arg Asn Ser Thr Gly Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
ACGTCCGTAC GTTCGAATTA ATTAATCGA 



29 




(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
CCTCCGGACG TGCACTAGCT CCCGTCTGTG GTAGTGGTGG TAGTGATTAT CAATTAATTG 60 



;:.:;( 2) INFORMATION FOR SEQ ID NO: 96: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GTTTAACCAC TGCATGATG 19 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D} TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA {genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 



GTCCCATCGA GTGCGGCTAC 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
CGTGACATGG TACAT TCCGG ACACTTGGCG CACTTCATAA GCGGA 
(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
TGCCTCATAC ACAATGGAGC TCTGGGACGA GTCGTTCGTG AC 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



SEQUENCE DESCRIPTION: SEQ ID NO: 100: 



TACCCAGCAG CGGGAGCTCT GTTGCTCCCG AACGCAGGGC AC 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
{iii) HYPOTHETICAL: NO 
{iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

TGTCGTGGTG GGGACGGAGG CCTGCCTAGC TGCGAGCGTG GG 
(2) INFORMATION FOR SEQ ID NO: 102: 

1 (i) SEQUENCE CHARACTERISTICS: 
\ (A) LENGTH: 48 base pairs 

I (B) TYPE: nucleic acid 

: (C) STRANDEDNESS: single 

f (D) TOPOLOGY: linear 

I (ii) MOLECULE TYPE: DNA (genomic) 

j (iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
1CGTTATGTGG CCCGGGTAGA TTGAGCACTG GCAGTCCTGC ACCGTCTC 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
CAGGGCCGTT CTAGGCCTCC ACTGCATCAT CATATCCCAA GC 



(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
CCGGAATGTA CCATGTCACG AACGAC 26 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

£B) TYPE: nucleic acid 
D (C) STRANDEDNESS: single 

\!i (D) TOPOLOGY: linear 

f|] (ii) MOLECULE TYPE: DNA (genomic) 

If! (iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE : NO 

2 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

IpCTCCATTGT GTATGAGGCA GCGG 24 
fi{2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
GAGCTCCCGC TGCTGGGTAG CGC 23 
(2) INFORMATION FOR SEQ ID NO: 107: 




(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
CCTCCGTCCC CACCACGACA ATACG 25 
(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
CTACCCGGGC CACATAACGG GTCACCG 27 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

GGAGGCCTAC AACGGCCCTG GTGG 24 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 



HCVENV.-T 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
TTCTATCGAT TAAATAGAAT TC 22 
(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
0 (iii) HYPOTHETICAL: NO 
y (iii) ANTI-SENSE: NO 

0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

GCCATACGCT CACAGCCGAT CCC 23 



HCVENV.-T 
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PURIFIED HEPATITIS C VIRUS ENVELOPE PROTEINS FOR DIAGNOSTIC AND 
THERAPEUTIC USE 



The present invention relates to the general fields of recombinant protein 
expression, purification of recombinant proteins, s\~thetic peptides, diagnosis of HCV 
infection, prophylactic treatment against HCV infection and to the prognosis/monitoring 

10 of the clinical efficiency of treatment of an individual with chronic hepatitis, or the 
prognosis/monitoring of natural disease. 

More particularly, the present invention relates tc purification methods for 
hepatitis C virus envelope proteins, the use in diagnosis, prophylaxis or therapy of HCV 
envelope proteins purified according to the methods described in the present invention, 

15 the use of single or specific oiigcmeric El and/or E2 and/or E1/E2 envelope proteins in 
assays for monitoring disease, and/or diagnosis of disease, and/or treatment of disease. 
The invention also relates to epitopes of the El and/or E2 envelope proteins and 
monoclonal antibodies thereto, as well their use in diagnosis, prophylaxis or treatment. 

20 Background of the invention 

The E2 protein purified from cell lysates according to the methods described in 
the present invention reacts with approximately 95% of patient sera. This reactivity is 
similar to the reactivity obtained with E2 secreted from CHO cells {Spaete et a!., 1 992). 

25 However, the intraceilufarly expressed form of E2 may more closely resemble the native 
virai envelope protein because it contains high mannose carbohydrate motifs, whereas 
the E2 protein secreted from CHO cells Is further modified with galactose and sialic acid 
sugar moieties. When the aminoterminal half of E2 is expressed in the baculovirus 
system, only about 13 to 21% of sera from several patient groups can be detected 

30 (lnoue et aL 1 992). After expression of E2 from E, coii, the reactivity of HCV sera was 
even lower and ranged from 14 {Yokosuka et aL, 1992) to 17% (Mita et aL, 1992K 
About 75% of HCV sera (and 95% of chronic patients) are anti-El positive using 
the purified, vaccinia-expressed recombinant El protein of the present invention, in 
sharp contrast with the results of Kohara et al. (1992) and Hsu et al. (1993). Kohara 
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et al. used a vaccinia-virus expressed £1 protein and detected anti-E1 antibodies in 7 
to 23% of patients, while Hsu%t ai. only detected 14/50 (28%) sera using bacuiovirus- 
expressed El . 

These results show that not only a good expression system but also a good 
5 purification protocol are required to reach a high reactivity of the envelope proteins with 
human patient sera. This can be obtained using the proper expression system and/or 
purification protocols of the present invention which guarantee the conservation of the 
natural folding of the protein and the purification protocols of the present invention 
which guarantee the elimination of contaminating proteins and which preserve the 

10 conformation, and thus the reactivity of the HCV envelope proteins. The amounts of 
purified HCV envelope protein needed for diagnostic screening assays are in the range 
of grams per year. For vaccine purposes, even higher amounts of envelope protein 
would be needed. Therefore, the vaccinia virus system may be used for selecting the 
best expression constructs and for limited upscaiinc, and large-scale expression and 

15 purification of single specific oiigomeric envelope proteins containing high-manncse 
carbohydrates may be achieved when expressed from several yeast strains. In the case 
of hepatitis B for example, manufacturing of HBsAg from mammalian ceils was much 
more mostly compared with yeast-derived hepatitis 3 vaccines. 

20 Aims of the invention 

It is an aim of the present invention to provide a new purification method for 
recombinantty expressed El and/or E2 and/or El /E2 proteins such that said recombinant 
proteins are directly usabie for diagnostic and vaccine purposes as single or specific 
25 oiigomeric recombinant proteins free from contaminants instead of aggregates. 

It is another aim of the present invention to provide compositions comprising 
purified (single or specific oiigomeric) recombinant El and/or E2 and/or E1/S2 
glycoproteins comprising conformational epitopes from the El and/or E2 domains of 
HCV. 

30 it is yet another aim of the present invention to provide novel recombinant vector 

constructs for recombinantiy expressing E1 and/or £2 and/or E1/E2 proteins, as well as 
host cells transformed with said vector constructs. 

It is also an aim of the present invention to provide a method for producing and 
purifying recombinant HCV El and/or E2 and/or E1/E2 proteins. 
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It is also an aim of the present invention to provide diagnostic and immunogenic 
uses of the recombinant HCV. El and/or E2 and/or E1/E2 proteins of the present 
invention, as well as to provide kits for diagnostic use, vaccines or therapeutics 
comprising any of the recombinant HCV 51 and/or E2 and/or E1/E2 proteins of the 
5 present invention. 

it is further an aim of the present invention to provide for a new use of El , E2, 
and/or E1 /E2 proteins, or suitable parts thereof, for monitoring/prognosing :he response 
to treatment of patients (e.g. with interferon) suffering from HCV infection. 

It is also an aim of the present invention to provide for the use of the 
10 recombinant El, £2, and/or E1/E2 proteins of the present invention in HCV screening 
and confirmatory antibody tests. 

it is also an aim of the present invention :o provide El and/or E2 peptides which 
can be used for diagnosis of HCV infection and for raising antibodies. Such peptides 
may also be used to isolate human monoclonal antibodiss. 
15 It is also an aim of the present invention to provide monoclonal antibodies, more 

particularly human monoclonal antibodies or mouse monoclonal antibodies which are 
humanized, which react specifically with El and/or E2 epitopes, either comprised in 
peptides or conformational epitopes comprised in recombinant proteins. 

It is also an aim of the present invention to provide possible uses of anti-El or 
20 anti-E2 monoclonal antibodies for HCV antigen detection or for therapy of chronic HCV 
infection. 

tt is also an aim of the present invention to provide kits for monitoring/prognosing 
the response to treatment (e.g. with interferon) of patients suffering from HCV infection 
or monitoring/prognosing the outcome of the disease. 
25 All the aims of the present invention are considered to have been met by the 

embodiments as set out below. 

Definitions 

30 The following definitions serve to illustrate the different terms and expressions 

used in the present invention. 

The term 'hepatitis C virus single envelope protein' refers to a polypeptide or an 
analogue thereof (e.g. mimotopes) comprising an amino acid sequence (and/or amino 
acid analogues) defining at least one HCV epitope of either the E1 or the E2 region. 
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These single envelope proteins in the broad sense of the word may be both monomeric 
or homo-oligomeric forms of recombinantiy expressed enveiooe proteins. Typically, the 
sequences defining the epitope correspond to the amino acid sequence of either the El 
or the £2 region of HCV (either identically or via substitution of analogues of the native 
5 amino acid residue that do not destroy the epitope). In general, the epttope-defining 
sequence will be 3 or more amino acids in length, more typically, 5 or more amino acids 
in length, more typically 8 or more amino acids in length, and even more typically 10 
or more amino acids in length. With respect to conformational epitopes, the length of 
the epitope-defining sequence can be subject to wide variations, since it is believed that 

10 these epitopes are formed by the three-dimensional shape of the antigen (e.g. folding). 
Thus, the amino acids defining the epitope can be relatively few in number, but widely 
dispersed along the length of the molecule being brought into the correct eprtope 
conformation via folding. The portions of the antigen between the residues defining the 
epitope may not be critical to the conformational structure of the epitope. For example, 

15 deletion or substitution of these intervening sequences may not affect the 
conformational epitope provided sequences critical to epitope conformation are 
maintained (e.g. cysteines involved in disulfide bonding, giycosylation sites, etc.). A 
conformational epitope may aiso be formed by 2 or mere essential regions of subunits 
of a homooligomer or heterooiigomer. 

20 The HCV antigens of the present invention comprise conformational epitopes 

from the El and/or E2 (envelope; domains of HCV. The E1 domain, which is believed 
to correspond to the virai envelope protein, is currently estimated to span amino acids 
192*383 of the HCV polyprotein (Hijikata et a!., 1991). Upon expression in a 
mammalian system (glycosylated), it is believed to have an approximate molecular 

25 weight of 35 kDa as determined via SDS-PAGE. The E2 protein, previously called NS1 . 
is beiieved to span amino acids 384-809 or 384-746 (Grakoui et al., 1 993) of the HCV 
polyprotein and to also be an envelope protein. Upon expression in a vaccinia system 
(glycosylated), it is believed to have an apparent gel molecular weight of about 72 kDa. 
It is understood that these protein endpoims are approximations (e.g. the carboxy 

30 terminal end of E2 could lie somewhere in the 730-820 amino acid region, e.g. ending 
at amino acid 730, 735, 740, 742, 744, 745, preferably 745, 747, 748, 750, 760, 
770, 780, 790, 800, 809, 810, 820). The E2 protein may aiso be expressed together 
with the El , P7 (aa 747-809), NS2 laa 81 0-1 025), NS4A (aa 1 558-171 1 > or NS4S (aa 
1712-1 972). Expression together with these other HCV proteins may be important for 
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obtaining the correct protein folding. 

It is also understood that the isolates used m the examples section of the present 
invention were not intended to limit the scope of the invention and that any HCV isolate 
from type 1, 2. 3, 4, 5, 6. 7. 8, 9, 10 or any other new genotype of HCV is a suitable 
source of El and/or E2 sequence for the practice of the present invention. 

The E1 and E2 antigens used in the present invention may be full-length viral 
proteins, substantially full-length versions thereof, or functional fragments thereof (e.g. 
fragments which are not missing sequence essential to the formation or retention of an 
epitope). Furthermore, the HCV antigens of the present invention can also include other 
sequences that do not block or prevent the formation of the conformational epitope of 
interest. The presence or absence of a conformational epitope can be readily determined 
though screening the antigen of interest with an antibody (polyclonal serum or 
monoclonal to the conformational epitope) and comparing its reactivity to that of a 
denatured version of the antigen which retains only linear epitopes (if any). In such 
screening using polyclonal antibodies, it may be advantageous tc adsorb the polyclonal 
serum first with the denatured antigen and see if it retains antibodies to the antigen of 
interest. 

The HCV antigens of the present invention can be made by any recombinant 
method that provides the epitope of intrest. For example, recombinant intracellular 
expression in mammalian or insect cells is a preferred method to provide glycosylated 
E1 and/or E2 antigens in 'native' conformation as is the case for the natural HCV 
antigens. Yeast cells and mutant yeast strains (e.g. mnn 9 mutant (Kniskern et ai., 
1 994) or glycosylation mutants derived by means of vanadate resistence selection 
(Bailou et ai., 1991)) may be ideally suited for production of secreted high-mannose-type 
sugars; whereas proteins secreted from mammalian cells may contain modifications 
including galactose or sialic acids which may be undesirable for certain diagnostic or 
vaccine applications. However, it may also be possible and sufficient for certain 
applications, as it is known for proteins, to express the antigen in other recombinant 
hosts (such as E. coli) and renature the protein after recovery. 

The term 'fusion polypeptide' intends a polypeptide in which the HCV antigen(s) 
are part of a single continuous chain of amino acids, which chain does not occur in 
nature. The HCV antigens may be connected directly to each other by peptide bonds or 
be separated by intervening amino acid sequences. The fusion polypeptides may also 
contain amino acid sequences exogenous to HCV. 
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The term 'solid phase' intends a solid body to which the individual HCV antigens 
or the fusion polypeptide comprised of HCV antigens are bound covalently or by 
noncovaient means such as hydrophobic adsorption. 

The term 'biological sample' intends a fluid or tissue of a mammalian individual 
5 (e.g. an anthropoid, a human) that commonly contains antibodies produced by the 
individual, more particularly antibodies against HCV. The fluid or tissue may also contain 
HCV antigen. Such components are known in the art and include, without limitation, 
blood, plasma, serum, urine, spinal fluid, lymph fluid, secretions of the respiratory, 
intestinal or genitourinary tracts, tears, saliva, milk, white biood cells and myelomas. 

10 Body components include biological liquids. The term 'biological liquid' refers to a fluid 
obtained from an organism. Some biological fluids are used as a source of other 
products, such as clotting factors (e.g. Factor V11!;C), serum albumin, growth hormone 
and the like. In such cases, it is important that the source of biological fluid be free of 
contamination by virus such as HCV. 

1 5 The term 'immunologically reactive' means that the antigen in question will react 

specifically with anti-HCV antibodies present in a body component from an HCV 
infected individual. 

The term 'immune complex' intends the combination formed when an antibody 
binds to an epitope on an antigen. 

20 'E1' as used herein refers to a protein or polypeptide expressed within the first 

400 amino acids of an HCV polyprotein, sometimes referred to as the E, ENV or S 
protein. In its natural form it is a 35 kDa glycoprotein which is found in strong 
association with membranes. In most natural HCV strains, the E1 protein is encoded in 
the viral polyprotein following the C (core) protein. The E1 protein extends from 

25 approximately amino acid (aa) 192 to about aa 383 of the full-length polyprotein. 

The term 'El' as used herein also includes analogs and truncated forms that are 
immunologically cross-reactive with natural 51, and includes El proteins of genotypes 
1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or any other newly identified HCV type or subtype. 

'E2' as used herein refers to a protein or polypeptide expressed within the first 

30 900 amino acids of an HCV polyprotein, sometimes referred to as the NS1 protein. In 
its natural form it is a 72 kDa glycoprotein that is found in strong association with 
membranes. In most natural HCV strains, the E2 protein is encoded in the viral 
polyprotein following the El protein. The E2 protein extends from approximately amino 
acid position 384 to amino acid position 746, another form of E2 extends to amino acid 
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position 309. The term '£2' as used herein also includes analogs and truncated forms 
that are immunologically cross-reactive with natural E2. For example, insertions of 
multiple codons between codon 383 and 384. as well as deletions of amino acids 334- 
387 have been reported by Kato et al. (1 952). 

'E1/E2' as used herein refers to an oiigomeric form of envelope proteins 
containing at least one El component and at least one E2 component. 

The term 'specific oiigomeric' E1 and/or E2 and/or E1/E2 envelope proteins refers 
to all possible oiigomeric forms of recombinantly expressed E1 and/or E2 enveiope 
proteins which are not aggregates. E1 and/or E2 specific oiigomeric envelope proteins 
are also referred to as homo-oiigomeric El or E2 enveiope proteins (see beiowl. 

The term 'single or specific oiigomeric' El and/or E2 and/or E1/E2 envelope 
proteins refers to single monomeric El or =2 proteins (single in the strict sense of the 
word) as well as specific oiigomeric El and/cr 52 and/or El /E2 recombinantly expressed 
proteins. These single or specific oligom-r-'c enveiope proteins according to the present 
invention can be further defined by the following formula (EDJE2), wherein x can be 
a number between 0 and 100, and y can be a number between o and 100. provided 
that x and y are not both 0. With x = 1 and y = 0 said envelope proteins include 
monomeric E1 . 

The term 'homo-oiigomer' as used herein refers to a complex of E1 and/or =2 
containing more than one El or E2 monomer, e.g. E1/E1 dimers, E1/E1/E1 trimers or 
El /El /El /El tetramers and E2/E2 dimers, E2/E2/E2 trimers or E2/E2/E2/E2 tetramers. 
El pentamers and hexamers, E2 pentamers and hexamers or any higher-order homo- . 
oligomers of El or E2 are ait 'homo-oligomers' within the scope of this definition. The 
oligomers may contain one, two, or several different monomers of El or E2 obtained 
from different types or subtypes of hepatitis C virus including for example those 
described in an international application published under WO 94-/25601 and European 
application No. 948701 66.9 both by the present applicants. Such mixed oligomers are 
still homo-oligomers within the scope of this invention, and may allow more universal 
diagnosis, prophylaxis or treatment of HCV. 

The term 'purified' as applied to proteins herein refers to a composition wherein 
the desired protein comprises at least 35% of the total protein component in the 
composition. The desired protein preferably comprises at least 40%. more preferably at 
least about 50%. more preferably at least about 60%. still more preferably at least 
about 70%, even more preferably at least about 80%, even more preferably at least 
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about 90%, and most preferably at least about 95% of the totai protein component. 
The composition may contairrother compounds such as carbohydrates, salts, lipids, 
solvents, and the like, withouth affecting the determination of the percentage purity as 
used herein. An 'isolated' HCV protein intends an HCV protein composition that is at 
5 least 35% pure. 

The term 'essentially purified proteins' refers to proteins purified such that they 
can be used for in vitro diagnostic methods and as a therapeutic compound. These 
proteins are substantially free from cellular proteins, vector-derived proteins or other 
HCV viral components. Usually these proteins are purified to homogeneity (at least 80% 
10 pure, preferably, 90%, more preferably 95%, more preferably 97%, more preferably 
98%, more preferably 99%, even more preferably 99.5%, and most preferably the 
contaminating proteins should be undetectable by conventional methods like 3DS-PAGE 
and stiver staining. 

The term 'recombinantiy expressed' used within the context of the present 
15 invention refers to the fact that the proteins of the present invention are produced by 

recombinant expression methods be it in prckaryotes, or lower or higher eukaryotes as 

discussed in detail below. 

The term 'lower eukarycte' refers to host cells such as yeast, fungi and the like. 

Lower eukaryotes are generally (but not necessarily) unicellular. Preferred lower 
20 eukaryotes are yeasts, particularly species within Saccharom vces , 

Schtzosaccharomvces , Kluveromvces, Fichia (e.g. Pichia pastoris ), Hansenula (e.g. 

Hansenula polymorphs) , Yarowia, Schwaniomvces, Schizosac charomvces, 

ZvQosaccharomvces and the like. Saccharomvces cerevisiae , S. carlsberaensis and K. 

lactis are the most commonly used yeast hosts, and are convenient fungal hosts. 
25 The term 'prokaryotes' refers to hosts such as E.coli , Lactobacillus , 

Lactococcus , Salmonella , Streptococcus , Bacillus subtiiis or Streotomvces . Also these 

hosts are contemplated within the present invention. 

The term 'higher eukaryote' refers to host cells derived from higher animals, such 

as mammals, reptiles, insects, and the like. Presently preferred higher eukaryote host 
30 cells are derived from Chinese hamster (e.g. CHOK monkey {e.g. COS and Vero cells), 

baby hamster kidney (BHKL pig kidney (PK1 5), rabbit kidney 1 3 cells (RK 1 3), the human 

osteosarcoma cell line 143 B, the human cell line HeLa and human hepatoma ceil lines 

like Hep G2, and insect cell lines (e.g. Soodootera fruoiperda ). The host cells may be 

provided in suspension or flask cultures, tissue cultures, organ cultures and the like. 
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Alternatively the host cells may aiso be transgenic animals. 

The term 'polypeptide' --refers to a polymer of amino acids and does not refer to 
a specific length of the product; thus, peptides, oligopeptides, and proteins are included 
within the definition of polypeptide. This term also does not refer to or exclude post- 
expression modifications of the polypeptide, for example, glycosylates, acetylations. 
phosphorylations and the like. Included within the definition are. for example, 
polypeptides containing one or more analogues of an amino acid [including, for example, 
unnatural amino acids, PIMA, etc.). polypeptides with substituted linkages, as well as 
other modifications known in the art. both naturally occurring and non-naturally 
occurring. 

The term 'recombinant polynucleotide or nucleic acid' intends a polynucleotide 
or nucleic acid of genomic, cDNA, semisynthetic, or synthetic origin which, by virtue 
of its origin or manipulation : (1] is net associated with all or a portion of a 
polynucleotide with which it is associated in nature. (2) is linked tc a polynucleotide 
other than that to which it is linked ,n nature, or (3) does not occur in nature. 

The term 'recombinant host cells', 'host cells', 'cells', 'cell lines', 'cell cultures', 
and other such terms denoting microorganisms or higher eukaryotic cell lines cultured 
as unicellular entities refer to cells which can be or have been, used as recipients for a 
recombinant vector or other transfer polynucleotide, and include the progeny of the 
original cell which has been transfected. It is understood that the progeny of a single 
parental cell may not necessarily be completely identical in morphology or in genomic 
or total DNA complement as the original parent, due to natural, accidental, or deliberate 
mutation. 

The term 'replicon' is any genetic element, e.g., a pla?mid. a chromosome, a 
virus, a cosmid, etc., that behaves as an autonomous unit of polynucleotide replication 
within a cell; i.e., capable of replication under its own control. 

The term 'vector' is a replicon further comprising sequences providing replication 
and/or expression of a desired open reading frame. 

The term 'control sequence' refers to polynucleotide sequences which are 
necessary to effect the expression of coding sequences to which they are ligated. The 
nature of such control sequences differs depending upon the host organism; in 
prokaryotes, such control sequences generally include promoter, ribosoma! binding site, 
and terminators; in eukaryotes, generally, such control sequences include promoters, 
terminators and, in some instances, enhancers. The term 'control sequences' is intended 
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to include, at a minimum, ail components whose presence is necessary for expression, 
and may aiso include additional components whose presence is advantageous, for 
example, leader sequences which govern secretion. 

The term 'promoter' is a nucleotide sequence which is comprised of consensus 
sequences which allow the binding of RNA polymerase to the DNA template in a manner 
such that mRNA production initiates at the normal transcription initiation site for the 
adjacent structural gene. 

The expression 'operabiy linked' refers to a juxtaposition wherein the 
components so described are in a relationship permitting them to function in their 
intended manner. A control sequence 'operabiy linked' to a coding sequence is ligated 
in such a way that expression of the coding sequence is achieved under conditions 
compatible with the control sequences. 

An 'open reading frame' (CRFj is a region of a polynucleotide sequence which 
encodes a polypeptide and does not contain stop codons; this region may represent a 
portion of a coding sequence or a total ceding sequence. 

A 'coding sequence' is a polynucleotide sequence which is transcribed into 
mRNA and/or translated into a polypeptide when placed under the control of appropriate 
regulatory sequences. The boundaries of the coding sequence are determined by a 
translation start codon at the 5'-terminus and a translation stop codon at the 3'- 
terminus. A coding sequence can include but is not limited to mRNA. DNA (including 
cDNA), and recombinant polynucleotide sequences. 

As used herein, 'epitope' or 'antigenic determinant' means an amino acid 
sequence that is immunoreactive. Generally an epitope consists of at least 3 to 4 amino 
acids, and more usually, consists of at least 5 or 6 amino acids, sometimes the epitope 
consists of about 7 to 8, or even about 10 amino acids. As used herein, an epitope of 
a designated polypeptide denotes epitopes with the same amino acid sequence as the 
epitope in the designated polypeptide, and immunologic equivalents thereof. Such 
equivalents also include strain, subtype ( = genotype), or type(group)-specific variants, 
e.g. of the currently known sequences or strains belonging to genotypes 1 a, 1 b, 1 c, 1 d. 
1e, 1f, 2a, 2b. 2c, 2d, 2e, 2f, 2g, 2h. 2i, 3a. 3b, 3c. 3d. 3e. 3f, 3g, 4a. 4b. 4c. 4d. 4e. 
4f, 4g, 4h, 4i, 4j, 4k, 41. 5a, 5b. 6a, 6b, 6c, 7a, 7b, 7c, 8a, 8b, 9a, 9b, 10a. or any 
other newly defined HCV (sub)type. it is to be understood that the amino acids 
constituting the epitope need not be part of a linear sequence, but may be interspersed 
by any number of amino acids, thus forming a conformational epitope. 
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The term 'immunogenic' refers to the ability of a substance to cause a humoral 
and/or cellular response, whether alone or when linked to a earner, in the presence or 
absence of an adjuvant. 'Neutralization' refers to an immune response that blocks the 
infectivity, either partially or fully, of an infectious agent. A 'vaccine* is an immunogenic 
composition capable of eliciting protection against HCV. whether partial or complete. 
A vaccine may also be useful for treatment of an individual, in which case it is called 
a therapeutic vaccine. 

The term 'therapeutic' refers to a composition capable of treating HCV infection. 
The term 'effective amount' refers to an amount of epitope-bearing polypeptide 
sufficient to induce an immunogenic response in the individual to which it is 
administered, or to otherwise detectsbly immunoreact in its intended system (e.g., 
immunoassay). Preferably, the effective amount is sufficient to effect treatment, as 
defined above. The exact amount necessary will vary according to the application. For 
vaccine applications or for the generation of polyclonal antiserum / antibodies, for 
example, the effective amount may very defending on the species, age. and general 
condition of the individual, the severity of the condition being treated, the particular 
polypeptide selected and its mode of administration, etc. It is also believed that effective 
amounts will be found within a relatively large, non-critical range. An -appropriate 
effective amount can be readily determined using only routine experimentation. 
Preferred ranges of El and/or E2 and/or E1/E2 single or specific oligemeric envelope 
proteins for prophylaxis of HCV disease are 0.01 to 100^g/dose, preferably 0.1 to 50 
//g/dose. Several doses may be needed per individual in order to achieve a sufficient 
immune response and subsequent protection against HCV disease. 



Detailed description of the invention 

More particularly, the present invention contemplates a method for isolating or 
purifying recombinant HCV single or specific oligomeric envelope protein selected from 
the group consisting of El and/or E2 and/or E1/E2, characterized in that upon lysing the 
transformed host ceils to isolate the recombinantiy expressed protein a disulphide bond 
cleavage or reduction step is carried out with a disculphide bond cleaving agent. 

The essence of these 'single or specific oligomeric' envelope proteins of the 
invention is that they are free from contaminating proteins and that they are not 
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disufphide bond finked with contaminants. 

The proteins according-'to the present invention are recombinantly expressed in 



lower or higher eukaryotic cells or in prokaryotes. The recombinant proteins of the 
present invention are preferably glycosylated and may contain high-mannose-type, 
hybrid, or complex glycosylates. Preferentially said proteins are expressed from 
mammalian cell lines as discussed in detaii in the Examples section, or in yeast such as 
in mutant yeast strains also as detailed in the Examples section. 

The proteins according to the present invention may be secreted or expressed 
within components of the ceil, such as the ER or the Goigi Apparatus. Preferably, 
however, the proteins of the present invention bear high-mannose-type glvcosyfations 
and are retained in the E3 or Gcigi Apoaratus of mammalian cells or are retained in or 
secreted from yeast ceils, preferably secreted from yeast mutant strains such as the 
mnn9 mutant (Kniskern et ai., 1 39^;, or from mutants that have been selected by 
means of vanadate resistence (Eallcu et a!., 1991), 

Upon expression of HCV envelope proteins, the present inventors could show 
that some of the free thiol groups of cysteines not involved in intra- or inter-molecular 
disulphide bridges, react with cysteines of host or expression-system-derived (e.g. 
vaccinia) proteins or of other HCV envelope proteins (single or oligomeric), and form 
aspecific intermolecuiar bridges. This results in the formation of 'aggregates' of HCV 
envelope proteins together with contaminating proteins. It was also shown in WO 
92/08734 that 'aggregates' were obtained after purification, but it was not described 
which protein interactions were involved. In patent application WO ,92/08734, 
recombinant E1/E2 protein expressed with the vaccinia virus system were partially 
purified as aggregates and only found to be 70% pure, rendering the purified aggregates 
not useful for diagnostic, prophylactic or therapeutic purposes. 

Therefore, a major aim of the present invention resides in the separation of single 
or specific-oligomeric HCV envelope proteins from contaminating proteins, and to use 
the purified proteins (> 95% pure) for diagnostic, prophylactic and therapeutic 
purposes. To those purposes, the present inventors have been able to provide evidence 
that aggregated protein complexes ('aggregates') are formed on the basis of disuiphide 
bridges and non-covalent protein-protein interactions. The present invention thus 
provides a means for selectively cleaving the disulphide bonds under specific conditions 
and for separating the cleaved proteins from contaminating proteins which greatly 
interfere with diagnostic, prophylactic and therapeutic applications. The free thiol groups 
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may be blocked (reversibly or irreversibly) in order to prevent the reformation of 
disulphide bridges, or may be left to oxidize and oiigomerize with other envelope 
proteins (see definition homo-oligomer). It is to be understood that such protein 
oligomers are essentially different from the 'aggregates' described in WO 92/08734 and 
WO 94/01773. since the level of contaminating proteins is undetectable. 
Said disuphide bond cleavage may aiso be achieved by: 

(1) performic acid oxidation by means of cysteic acid in which case the cysteine 
residues are modified into cysteic acid (Moore et al., 1953). 

(2) Suifitolysis (R-S-S-R - 2 R-S0' 3 ) for example by means of sulphite (SO : \) together 
with a proper oxidant such as Cu 2 ' in which case the cysteine is modified into S-sulpho- 
cysteine (Bailey and Cole. 1959). 

(3) Reduction by means of mercaptans, such as dithictreito! (DDT), S-mercapto-ethanoi. 
cysteine, glutathione Red, e-mercapto-atnylamine. or thicglycollic acid, of which DTT 
and B-mercapto-ethanol are commonly used (Cleiand. 1 964), is the preferred method 
of this invention because the method can be performed in a water environment and 
because the cysteine remains unmodified. 

(4) Reduction by means of a phosphine (e.g. Bu 3 P) (Ruegg and Rudinger, 1977). 

All these compounds are thus to be regarded as agents or means for cleaving 
disulphide bonds according to the present invention. 

Said disulphide bond cleavage (or reducing) step of the present invention is 
preferably a partial disulphide bond cleavage (reducing) step (carried out under partial 
cleavage or reducing conditions). 

A preferred disulphide bond cleavage or reducing agent according to the present 
invention is dithiothreitoi (DTT). Partial reduction is obtained by using a low 
concentration of said reducing agent, i.e. for DTT for example in the concentration range 
of about 0.1 to about 50 mM, preferably about 0.1 to about 20 mM, preferably about 
0.5 to about 1 0 mM, preferably more than 1 mM, more than 2 mM or more than 5 mM. 
more preferably about 1.5 mM, about 2.0 mM, about 2.5 mM, about 5 mM or about 
7.5 mM. 

Said disulphide bond cleavage step may also be carried out in the presence of a 
suitable detergent (as an example of a means for cleaving disulphide bonds or in 
combination with a cleaving agent) able to dissociate the expressed proteins, such as 
DecylPEG, EMPIGEN-BB, NP-40. sodium cholate, Triton X-100. 

Said reduction or cleavage step (preferably a partial reduction or cleavage step) 
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is carried out preferably in m rhe presence of (with) a detergent. A preferred detergent 
according to the present invention is Empigen-SE. The amount of detergent used is 
preferably in the range of 1 to 10 %. preferably more than 3°c, more preferably about 
3.5% of a detergent such as 5mpigen-3E. 

A particularly preferred method for obtaining disuiphide bond cleavage employs 
a combination of a classical disuiphide bond cleavage agent as detailed above and a 
detergent (also as detailed above). As contemplated in the Examples section, the 
particular combination of a low concentration of DTT (1 .5 to 7.5 mM) and about 3.5 % 
of Empigen-B8 is proven to be a particularly preferred combination of reducing agent and 
detergent for the purification of recombinantly expressed E1 and E2 proteins. Upon 
gelfiitration chromatography, said partial -eduction is shown to result in the production 
of possibly dimeric E1 protein and separation of this E1 protein from contaminating 
proteins that cause false reactivity upon use in immunoassays. 

It is, however, to be understood that alsc any other combination of any reducing 
agent known in the art with any detergent or other means known in the art to make the 
cysteines better accessible is also within the scope of the present invention, insofar as 
said combination reaches the same goaf of disuiphide bridge cleavage as the preferred 
combination exampiified in the present invention. 

Apart from reducing the disuiphide bonds, a disuiphide bond cieaving means 
according to the present invention may also include any disuiphide bridge exchanging 
agents (competitive agent being either organic or proteinaeous, see for instance 
Creighton, 1 988) known in the art which allows the following type of reaction to occur: 
Rl S - S R2 -r R3 SH -* Rl S - S R3 + R2 SH 

* R1, R2: compounds of protein aggregates 

* R3 SH: competitive agent (organic, proteinaeous) 

The term 'disuiphide bridge exchanging agent' is to be interpretated as including 
disuiphide bond reforming as well as disuiphide bond blocking agents. 

The present invention also relates to methods for purifying or isolating HCV 
single or specific oligomeric enveleiope proteins as set out above further including the 
use of any SH group blocking or binding reagent known in the art such as chosen from 
the following list: 

Glutathion 

5.5'-dithiobis-{2-nitrobenzoic acid) or bis-(3-carboxy-4-nitropheny!)-disulphice 
(DTNB or Ellman's reagent) (Eimann. 1959) 
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N-ethylmaleimide {MEM; Benesch 5: al., 1955) 

N-(4-dimethylamino-3,5^initrophenyi) maieimide or Tuppy's maieimide which 
provides a coior to the protein 
P-chloromercuribenzoate {Grassetti et al., 1969) 

4-vinylpyridine {Friedman and Kruil, 1 969) can be liberated after reaction by acid 
hydrolysis 

acrylonitrile, can be liberated after reaction by acid hydrolysis (Weil and Seibies, 
1961} 

NEM-biotin {e.g. obtained from Sigma 31267) 
2,2'-dithiopyridine (Grassetti and Murray. 1967] 
4,4'-dithiopyridine {Grassetti and Murray, 1967) 
6,5 f -dithiodinicontinic acid (DTDNA; Brown and Cunnigham, 1970) 
2,2'-dithiobis-(5'-nitropyridine) (D7NP; US paten: 3597160) or other ditniobis 
{heterocyclic derivative! compounds (Grassetti and Murray, 1969) 
A survey of the publications cited shows that often different reagents for 
sulphydryi groups will react with varying numbers of thiol groups of the same protein 
or enzyme molecule. One may conclude that this variation in reactivity of the thiol 
groups is due to the steric environment of these groups, such as the shape of the 
molecule and the surrounding groups of atoms and their charges, as well as to the size, 
shape and charge of the reagent molecule or ion. Frequently the presence of adequate 
concentrations of denaturants such as sodium dodecylsuifate, urea or guanidine 
hydrochloride will cause sufficient unfolding of the protein molecule to permit equal 
access to all of the reagents for thiol groups. By varying the concentration of 
denaturant, the degree of unfolding can be controlled and in this way thiol groups with 
different degrees of reactivity may be revealed. Although up to date most of the work 
reported has been done with p-chiorornercuribenzoate, N-ethyimaieimide and DINE, it 
is likely that the other more recently developed reagents may prove equally useful. 
Because of their varying structures, it seems likely, in fact, that they may respond 
differently to changes in the steric environment of the thiol groups. 

Alternatively, conditions such as low pH {preferably lower than pH 6) for 
preventing free SH groups from oxidizing and thus preventing the formation of large 
intermolecular aggregates upon recombinant expression and purification of E1 and E2 
(envelope) proteins are also within the scope of the present invention. 

A preferred SH group blocking reagent according to the present invention is N- 
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etnylmaleimide (NEM). Said SH croup blocking reagent may be administrated during lysis 
of the recombinant host cells ind after the above-mentioned partial reduction process 
or after any other process for cleaving disulphide bridges. Said SH group blocking 
reagent may also be modified with any group capable of providing a detectable label 
5 and/or any group aiding in the immobilization of said recombinant protein to a solid 
substrate, e.g. biotinylated NEM. 

Methods for cleaving cysteine bridges and blocking free cysteines have also been 
described in Darbre (19871, Means and Feeney (1971), and by Wong (1993). 

A method to purify single or specific oiigomeric recombinant El and/or E2 and/or 
10 E1/E2 proteins according to the present invention as defined above is further 
characterized as comprising the following steps: 

iysing recombinant E1 anc/cr E2 and, or E1/E2 expressing host cells, preferably 
in the presence of an SH group blocking agent, such as N-ethyimaieimide (NEM), 
and possibly a suitable detergent, preferably Empigen-BS, 
15 - recovering said HCV envelope protein by affinity purification for instance by 
means lectin-chromatography, such as lentil-lectin chromatography, or 
immunoaffinity chromatography using anti-El and/or anti-E2 specific monoclonal 
antibodies, followed by, 

reduction or cleavage of disulphide bonds with a disulphide bond cleaving agent, 
20 such as DT7, preferably also in the presence of an SH group blocking agent, 

such as NEM or Biotin-NEM, and, 

recovering the reduced HCV E1 an<i/or E2 and/or E1/E2 envelope proteins for 
instance by gelf titration (size exclusion chr ^matography or molecular sieving)*and 
possibly also by an additional Ni 2 *-IMAC chromatography and desalting step. 
25 It is to be understood that the above-mentioned recovery steps may also be 

carried out using any ether suitable technique known by the person skilled in the art. 

Preferred lectin-chromatography systems include Galanthus nivalis agglutinin 
(GNA) - chromatography, or Lens cuiinaris agglutinin (LCA) (lentil) lectin chromatography 
as illustrated in the Examples section. Other useful lectins include those recognizing 
30 high-mannose type sugars, such as Narcissus oseudonarcissus agglutinin (NPA), Pisum 
sativum agglutinin (PSA), or Allium ursinum agglutinin (AUA). 

Preferably said method is* usable to purify single or specific oiigomeric HCV 
envelope protein produced intracellular as detailed above. 

For secreted E1 or E2 or E1/E2 oligomers, lectins binding complex sugars such 
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as Hicinus communis agglutinin I {RCA I), are preferred learns. 

The present invention*- more particularly contemplates essentially purified 
recombinant HCV single or specific oligomeric envelope proteins, selected from the 
group consisting of El and/or E2 and/or E1/E2, characterized as being isolated or 
5 purified by a method as defined above. 

The present invention more particularly relates to the purification or isolation of 
recombinant envelope proteins which are expressed from recombinant mammalian cells 
such as vaccinia. 

The present invention also relates to the purification or isolation of recombinant 
10 envelope proteins which are expressed from recombinant yeast cells. 

The present invention equally relates to the purification or isolation of 
recombinant envelope proteins which are expressed from recombinant bacterial 
(prokaryotic) cells. 

The present invention also contemplates a recombinant vector comprising a 

15 vector sequence, an appropriate prokaryotic, eukaryotic or viral or synthetic promoter 
sequence followed by a nucleotide sequence allowing the expression of the single or 
specific oligomeric El and/or E2 and/or E1/E2 of the invention. 

Particularly, the present invention contemplates a recombinant vector comprising 
a vector sequence, an appropriate prokaryotic, eukaryotic or viral or synthetic promoter 

20 sequence followed by a nucleotide sequence allowing the expression of the single El 
or El of the invention. 

Particularly, the present invention contemplates a recombinant vector comprising 
a vector sequence, an appropriate prokaryotic, eukaryotic or viral or synthetic promoter 
sequence followed by a nucleotide sequence allowing the expression of the single El 

25 or E2 of the invention. 

The segment of the HCV cDNA encoding the desired El and/or E2 sequence 
inserted into the vector sequence may be attached to a signal sequence. Said signal 
sequence may be that from a non-HCV source, e.g. the IgG or tissue plasminogen 
activator (tpa) leader sequence for expression in mammalian cells, or the a-mating factor 

30 sequence for expression into yeast cells, but particularly preferred constructs according 
to the present invention contain signal sequences appearing in the HCV genome before 
the respective start points of the El and E2 proteins. The segment of the HCV cDNA 
encoding the desired El and/or E2 sequence inserted into the vector may also include 
deletions e.g. of the hydrophobic domain(s) as illustrated in the examples section, or of 
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the 52 hypervariable region L 

More particularly, the recombinant vectors according 10 the present invention 
encompass a nucieic acid having an HCV cDNA segment encoding the poiyprotein 
starting in the region between amino acid positions 1 and 1 92 and ending in the region 
5 between positions 250 and 400 of the HCV poiyprotein, more preferably ending in the 
region between positions 250 and 341, even more preferably ending in the region 
between positrons 290 and 341 for expression of the KCV single E1 protein. Most 
preferably, the present recombinant vector encompasses a recombinant nucleic acid 
having a HCV cDNA seqment encoding part of the HCV poiyprotein starting in the 

10 region between positions 117 and 192, and ending at any position in the region 
between positions 253 and 325, for expression of HCV single El protein. Also within 
the scope of the present invention are forms that have the first hydrophobic domain 
deleted (positions 264 to 2S3 plus or minus 8 amino acids), or forms to which a 5'- 
terminal ATG codon and a 3'-terminai stop codon has been added, or forms which have 

15 a factor Xa cleavage site and/or 3 to 10, preferably S Histidine codons have been 
added. 

More particularly, the recombinant vectors according to the present invention 
encompass a nucieic acid having an HCV cDNA segment encoding the poiyprotein 
starting in the region between amino acid positions 290 and 406 and ending in the 

20 region between positions 500 and 820 of the HCV poiyprotein, more preferably starting 
in the region between positions 322 and 406, even more preferably starting in the 
region between positions 347 and 406, even still more preferably starting in the region 
between positions 364 and 405 for expression of the HCV single E2 protein. Most 
preferably, the present recombinant vector encompasses a recombinant nucleic acid 

25 having a HCV cDNA seqment encoding the poiyprotein starting in the region between 
positions 290 and 406, and ending at any position of positions 523, 650, 661, 673, 
710, 715, 720, 746 or 809, for expression of HCV single E2 protein. Also within the 
scope of the present invention are forms to which a 5'-terminal ATG codon and a 3'- 
terminal stop codon has been added, or forms which have a factor Xa cleavage site 

30 and/or 3 to 10, preferably 6 Histidine codons have been added. 

A variety of vectors may be used to obtain recombinant expression of HCV single 
or specific oiigomeric envelope proteins of the present invention. Lower eukaryotes such 
as yeasts and glycosyiation mutant strains are typically transformed with piasmids, or 
are transformed with a recombinant virus. The vectors may replicate within the host 
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independently, or may integrate into the host ceil genome. 

Higher eukaryotes mayts transformed with vectors, or may be infected with a 
recombinant virus, for example a recombinant vaccinia virus. Techniques and vectors 
for the insertion of foreign DNA into vaccinia virus are well known in the art. and utilize, 
for example homologous recombination. A wide variety of viral promoter sequences, 
possibly terminator sequences and poly(A)-addition sequences, possibly enhancer 
sequences and possibly amplification sequences, all required for the mammalian 
expression, are available in the art. Vaccinia is particularly preferred since vaccinia halts 
the expression of host cell proteins. Vaccinia is also very much preferred since it allows 
the expression of E1 and 52 proteins of HCV in cells or individuals which are immunized 
with the live recombinant vaccinia virus. For vaccination of humans the avipox and 
Ankara Modified Virus (AMV) are particularly useful vectors. 

Also known are insect expression transfer vectors derived from baculovirus 
Autooraoha californica nuclear polyhidrosis virus (AcNPV), which is a helper- 
independent viral expression vector. Ex- -ssion vectors derived from this system usually 
use the strong viral polyhedrin gene p jmoter to drive the expression of heterologous 
genes. Different vectors as well as methods for the introduction of heterologous DNA 
into the desired site of baculovirus are available to the man skilled in the art for 
baculovirus expression. Also different signals for posttranslational modification 
recognized by insect cells are known in the art. 

Also included within the scope of the present invention is a method for producing 
purified recombinant single or specific oligomeric HCV El or E2 or E1/E2 proteins, 
wherein the cysteine residues involved in aggregates formation are replaced at the level 
of the nucleic acid sequence by other residues such that aggregate formation is 
prevented. The recombinant proteins expressed by recombinant vectors caarying such 
a mutated E1 and/or E2 protein encoding nucleic acid are also within the scope of the 
present invention. 

The present invention also relates to recombinant E1 and/or E2 and/or E1/E2 
proteins characterized in that at least one of their giycosylation sites has been removed 
and are consequently termed giycosylation mutants. As explained in the Examples 
section, different giycosylation mutants may be desired to diagnose (screening, 
confirmation, prognosis, etc.) and prevent HCV disease according to the patient in 
question. An E2 protein giycosylation mutant lacking the GLY4 has for instance been 
found to improve the reactivity of certain sera in diagnosis. These giycosylation mutants 
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are preferably purified according to the method disclosed in rhe present invention. Also 
contemplated within the preseht invention are recombinant vectors carrying the nucleic 
acid insert encoding such a E1 and/or £2 and/or E1/E2 glycosylate mutant as we!! as 
host cells tranformed with such a recombinant vector. 

The present invention also relates to recombinant vectors including a 
polynucleotide which also forms part of the present invention. The present invention 
relates more particularly to the recombinant nucleic acids as represented in SEQ ID NO 
3. 5, 7. 9. 11, 13. 21. 23, 25, 27, 29. 31. 35. 37, 39, 41 . 43. 45. 47 and 49, or parts 
thereof. 

The present invention also contemplates host ceils transformed with a 
recombinant vector as defined above, wherein said vector comprises a nucleotide 
sequence encoding HCV El and/or E2 and/or E1/E2 protein as defined above in addition 
to a regulatory sequence operabiy linked to said HCV E1 and/or E2 and/or E1/E2 
sequence and capable of regulating the expression of said HCV E1 and/or E2 and/or 
E1/E2 protein. 

Eukaryotic hosts include lower and higher eukaryotic hosts as described in the 
definitions section. Lower eukaryotic hosts include yeast cells well known in the art. 
Higher eukaryotic hosts mainly include mammalian eel! lines known in the art and 
include many immortalized cell lines available from the ATCC, inluding HeLa cells. 
Chinese hamster ovary (CHO) cells. Baby hamster kidney (BHK) cells, PK15, RK13 and 
a number of other cell lines. 

The present invention relates particularly to a recombinant El and/or, E2 and/or 
E1/E2 protein expressed by a host eel! as defined above containing a recombinany 
vector as defined above. Tnese recombinant proteins are particularly purified according 
to the method of the present invention. 

A preferred method for isolating or purifying HCV envelope proteins as defined 
above is further characterized as comprising at least the following steps: 

growing a host cell as defined above transformed with a recombinant vector 

according to the present invention or with a known recombinant vector 

expressing El and/or E2 and/or E1/E2 HCV envelope proteins in a suitable 

culture medium. 

causing expression of said vector sequence as defined above under suitable 
conditions, and, 

lysing said transformed host cells, preferably in the presence of a SH group 
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blocking agent, such as N-ethylmaieimide (NEMl. and possibiv a suitable 
detergent, preferably Emptgen-BE. 

recovering said KCV envelope protein by affinity purification such as by means 
of lectin-chromatography or immunoaffinity chromatography using anti-El and/or 
5 anti-E2 specific monoclonal antibodies, with said lectin being preferably lentil- 

lectin or GNA, followed by, 

incubation of the eluate of the previous step with a disufphide bond cleavage 

means, such as D7T, preferably followed by incubation with an SH group 

blocking agent, such as NEM or Biatin-NEM, and, 
10 - isolating the HCV single or specific oligomeric El and/or E2 and/or E1/E2 

proteins such as by means of geifiitration and possibly also by a subsequent 

Ni 2 *-IMAC chromatography followed by a desalting step. 

As a result of the above-mentioned prccss. El and/or E2 and/Or E1/E2 proteins 
may be produced in a form which elute differently from the large aggregates containing 
1 5 vector-derived components and/or cell components in the void volume of the geifiitration 
column or the IMAC collumn as illustrated in the Examples section. The disufphide 
bridge cleavage step advantageously also eliminates the false reactivity due to the 
presence of host and/or expression-system-derived proteins. The presence of NEM and 
a suitable detergent during lysis of the cells may already partly or even completely 
20 prevent the aggregation between the HCV envelope proteins and contaminants. 

Ni 2 ~-IMAC chromatography followed by a desalting step is preferably used for 
contructs bearing a (His) 5 as described by Janknecht et al., 1991, and Hochuli et al., 
1988. 

The present invention also relates to a method for producing monoclonal 
25 antibodies in small animals such as mice or rats, as well as a method for screening and 
isolating human B-cells that recognize anti-HCV antibodies, using the HCV single or 
specific oligomeric envelope proteins of the present invention. 

The present invention further relates to a composition comprising at least one of 
the following E1 peptides as listed in Table 3: 
30 El -3 1 (SEQ ID NO 55) spanning amino acids 181 to 200 of the Core/El VI 

region, 

El -3 3 (SEQ ID NO 57} spanning amino acids 1 93 to 21 2 of the El region. 
El -35 (SEQ ID NO 58] spanning amino acids 205 to 224 of the E1 V2 region 
(epitope B), 
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E1-35A (SEQ ID NO 59) spanning amino acids 208 to 227 of the El V2 region 
(epitope 3), 2 * 

IbEl (SEQ ID NO 53) spanning amino acids 192 to 223 of El regions (V1, CI, 
and V2 regions (containing epitope B)), 
5 El -5 1 (SEQ ID NO 66) spanning amino acids 301 to 320 of the El region, 

El -53 (SEQ ID NO 67) spanning amino acids 313 to 332 of the E1 C4 region 
(epitope A), 

E1-55 (SEQ ID NO 63) spanning amino acids 325 to 344 of the El region. 
The present invention also relates to a composition comprising at least one of the 
10 following E2 peptides as fisted in Table 3: 

Env 67 or E2-67 (SEQ ID NO 72) spanning amino acid positions 397 to 416 of 
the E2 region (epitope A, recognized by mcnocional antibody 2F10K10, see 
Figure 19), 

Env 69 or E2-69 (SEQ ID NO 73) spanning amino acid positions 409 to 423 of 
15 the E2 region (epitope A), 

Env 23 or E2-23 (SEQ ID NO 86) spanning positions 533 to 602 of the E2 region 
(epitope E), 

Env 25 or E2-25 (SEQ ID NO 87) spanning positions- 595 to 614 of the E2 region 
(epitope EL 

20 Env 27 or E2-27 (SEQ ID NO 88) spanning positions 607 to 626 of the E2 region 

(epitope E), 

Env 17B or E2-17B (SEQ ID NO 83) spanning positions 547 to 565. of the E2 
region (epitope D), 

Env 13B or -E2-1 38 (SEQ ID NO 82) spanning positions 523 to 542 of the E2 
25 region (epitope C; recognized by monoclonal antibody 1 6A6E7, see Figure 1 9). 

The present invention also relates to a composition comprising at least one of the 
following E2 conformational epitopes: 

epitope F recognized by monoclonal antibodies 15C8C1, 12D11F1 and 
8G10D1H9, 

30 epitope G recognized by monoclonal antibody 9G3E5, 

epitope H (or C) recognized by monoclonal antibody 10D3C4 and 4H6B2, or, 
epitope I recognized by monoclonal antibody 1 7F2C2. 

The present invention also relates to an E1 or E2 specific antibody raised upon 
immunization with a peptide or protein composition, with said antibody being specifically 
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reactive with any of the polypeptides or peptides as defined above, and with said 
antibody being preferably a monoclonal antibody. 

The present invention also relates :o an El or E2 specific antibody screened from 
a variable chain library in plasmids or phages or from a population of human B-cells by 
means of a process known in the art. with said antibody being reactive with any of the 
polypeptides or peptides as defined above, and with said antibody being preferably^ 
monoclonal antibody. 

The El or E2 specific monocionai antibodies of the invention can be produced 
by any hybridoma liable to be formed according to classical methods from splenic cells 
of an animal, particularly from a mouse or rat. immunized against the HCV polypeptides 
or peptides according to the invention, as defined above on the one hand, and of cells 
of a myeloma cell line on the other hand, and to be selected by the ability of the 
hybridoma to produce the monoclonal antibodies recognizing the polypeptides which has 
been initially used for the immunization of the animals. 

The antibodies involved in the invention can be labelled by an appropriate label 
of the enzymatic, fluorescent, or radioactive type. 

The monocionai antibodies according to this preferred embodiment of the 
invention may be humanized versions of mouse monoclonal antibodies made by means 
of recombinant DNA technology, departing from parts of mouse and/or human genomic 
DNA sequences coding for H and L chains from cDNA or genomic clones coding for H 
and L chains. 

Alternatively the monoclonal antibodies according to this preferred embodiment 
of the invention may be human monocionai antibodies. These antibodies according to 
the present embodiment of the invention can also be derived from human peripheral 
blood lymphocytes of patients infected with HCV, or vaccinated against HCV. Such 
human monoclonal antibodies are prepared, for instance, by means of human peripheral 
blood lymphocytes (PBL) repopulation of severe combined immune deficiency (SCID) 
mice {for recent review, see Duchosal et ai., 19S2). 

The invention also relates to the use of the proteins or peptides of the invention, 
for the selection of recombinant antibodies by the process of repertoire cloning (Persson 
eta!.. 1991 J. 

Antibodies directed to peptides or single or specific oligomeric envelope proteins 
derived from a certain genotype may be used as a medicament, more particularly for 
incorporation into an immunoassay for the detection of HCV genotypes (for detecting 
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the presence of HCV El or E2 antigen), for prognosing/monitoring of HCV disease, or 
as therapeutic agents. 

Alternatively, the present invention also relates to the use of any of the above- 
specified El or E2 specific monoclonal antibodies for the preparation of an immunoassay 
kit for detecting the presence of El or E2 antigen in a biological sample, for the 
preparation of a kit for prognosing/monitoring of HCV disease or for the preparation of 
a HCV medicament. 

The present invention also relates to the a method for in vitro diagnosis or 
detection of HCV antigen present in a biological sample, comprising at least the 
following steps : 

(i) contacting said biological sample with any of the E1 and/or E2 specific 
monoclonal antibodies as defined above, preferably in an immobilized 
form under appropriate conditions which allow the formation of an 
immune complex, 

(ii) removing unbound components, 

(iii) incubating the immune complexes formed with heterologous antibodies, 
which specifically bind to the antibodies present in the sample to be 
analyzed, with said heterologous antibodies having conjugated to a 
detectable label under appropriate conditions, 

(iv) detecting the presence of said immune complexes visually or 
mechanically (e.g. by means of densitometry, fluorimetry, colorimetry). 

The present invention also relates to a kit for in vitro diagnosis of HCV antigen 
present in a biological sample, comprising: 

at least one monoclonal antibody as defined above, with said antibody 
being preferentially immobilized on a solid substrate, 
a buffer or components necessary for producing the buffer enabling 
binding reaction between these antibodies and the HCV antigens present 
in the biological sample, 

a means for detecting the immune complexes formed in the preceding 
binding reaction, 

possibly also including an automated scanning and interpretation device 
for inferring the HCV antigens present in the sample from the observed 
binding pattern. 

The present invention also relates to a composition comprising El and/or E2 
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and/or El . £2 recombinant HCV proteins purified according to the method of the present 
invention or a composition comprising at feast one peptides as specified above for use 
as a medicament. 

The present invention more particularly relates to a composition comprising at 
least one of the above-specified envelope peptides or a recombinant envelope protein 
composition as defined above, for use as a vaccine for immunizing a mammal, 
preferably humans, against HCV, comprising administering a sufficient amount of the 
composition possibly accompanied by pharmaceutical^ acceptable adjuvants),, to 
produce an immune response. 

More particularly, the present invention relates to the use of any of the 
compositions as described here above for the preparation of a vaccine as described 
above. 

Also, the present invention relates to a vaccine composition for immunizing a 
mammal, preferably humans, against HCV, comprising HCV single or specific oligomeric 
proteins or peptides derived from the El and/or the E2 region as described above. 

Immunogenic compositions can be prepared according to methods known in the 
ar L . The present compositions comprise an immunogenic amount of a recombinant El 
and/or E2 and/or E1/E2 single or specific oligomeric proteins as defined above or El or 
E2 peptides as defined above, usually combined with a pharmaceutical^ acceptable 
carrier, preferably further comprising an adjuvant. 

Tne single or specific oligomeric envelope proteins of the present invention, either 
El and/or E2 and/or El /E2, are expected to provide a particularly useful vaccine antigen, 
since the formation of antibodies to either El or E2 may be more desirable than to the 
other envelope protein, and since the E2 protein is cross-reactive between HCV types 
and the E1 protein is type-specific. Cocktails including type 1 E2 protein and E1 proteins 
derived from several genotypes may be particularly advantageous. Cocktails containing 
a molar excess of E1 versus E2 or E2 versus E1 rray also be particularly useful. 
Immunogenic compositions may be administered to animals to induce production of 
antibodies, either to provide a source of antibodies or to induce protective immunity in 
the animal. 

Pharmaceutical^ acceptable carriers include any carrier that does not itself 
induce the production of antibodies harmful to the individual receiving the composition. 
Suitable carriers are typically large, slowly metabolized macromolecules such as 
proteins, polysaccharides, polylactic acids, polygiycolic acids, polymeric amino acids. 
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amino acid copolymers; and inactive virus particles. Such earners are well know; 
these of ordinary skill in the art. 

Preferred adjuvants to enhance effectiveness of the composition incfude, but are 
not limited to : aluminim hydroxide (aium), N-acetyI-muramyi-L-threony[-D-isoglutamine 
(thr-MDP) as found in U.S. Patent No. 4.506,913, N-acetyl-normuramyl-L-alanyl-D- 
isogiutamine (nor-MDP), N-acstylmuramy!-L-alanyl-D-isoglutaminyl-L-aIanine-2-(r-2'- 
dipalmitoyf-sn-glYcero-3-hydroxyphosphoryloxy)-ethylarnine (MTP-PE) and R1BI, which 
contains three components extracted from bacteria, monophosphoryi lipid A. trehalose 
dimycoiate, and cell wall skeleton (MPL-rTDM + CWS) in a 2% squalene.Tween 80 
emulsion. Any of the 3 components MPL, TDM or CWS may also be used alone or 
combined 2 by 2. Additionally, adjuvants such as Stimulon (Cambridge Bioscience, 
Worcester, MA) or SAF-1 (Syntex) may be used. Further, Complete Freund's Adjuvant 
(CPA) and Incomplete Freund's Adjuvant UFA) may be used for non-human applications 
and research purposes. 

The immunogenic compositions typically will contain pharmaceutical! y acceptable 
vehicles, such as water, saline, glycerol, ethanol, etc. Additionally, auxiliary substances, 
such as wetting or emulsifying agents, pH buffering substances, preservatives, and the 
like, may be included in such vehicles. 

Typically, the immunogenic compositions are prepared as injectabies, either as 
liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, 
liquid vehicles prior to injection may also be prepared. The preparation also may be 
emuisified or encapsulated in liposomes for enhanced adjuvant effect. The El and E2 
proteins may also be incorporated into Immune Stimulating Complexes together with 
saponins, for example Quil A (ISCOMSh 

Immunogenic compositions used as vaccines comprise a 'sufficient amount' or 
'an immunologically effective amount' of the envelope proteins of the present invention, 
as well as any other of the above mentioned components, as needed. 'Immunologically 
effective amount', means that the administration of that amount to an individual, either 
in a single dose or as part of a series, is effective for treatment, as defined above. This 
amount varies depending upon the health and physical condition of the individual to be 
treated, the taxonomic group of individual to be treated (e.g. nonhuman primate, 
primate, etc.), the capacity of the individual's immune system to synthesize antibodies, 
the degree of protection desired, the formulation of the vaccine, the treating doctor's 
assessment of the medical situation, the strain of infecting HCV, and other relevant 
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factors, it is expected that the amount will fail in a relatively broad range that can be 
determined through routine trials. Usually, the amount will vary from 0.01 to 1000 
jjq/dosa, more particularly from 0.1 to 100 jt/g/dcse. 

The single or specific oiigomeric envelope proteins may also serve as vaccine 
carriers to present homologous (e.g. T ceil epitopes or 8 ceil epitopes from the core, 
NS2, NS3, NS4 or NS5 regions) or heterologous (non-HCV) haptens, in the same 
manner as Hepatitis 3 surface antigen (see European Patent Application 1 74,444). in 
this use, envelope proteins provide an immunogenic carrier capable of stimulating an 
immune response to haptens or antigens conjugated to the aggregate. The antigen may 
be conjugated either by conventional chemical methods, or may be cloned into the gene 
encoding E1 and/or E2 at a location corresponding to a hydrophilic region of the protein. 
Such hydrophylic regions include the VI region (encompassing amino acid positions 1 9 1 
to 202), the V2 region (encompassing amino acid positions 21 3 to 223), the V3 recion 
(encompassing amino acid positions 230 to 2*2). the V4 region (encompassing amino 
acid positions 230 to 242), the V5 region (encompassing amino acid positions 294 to 
303) and the VS region (encompassing amino acid positions 329 to 336). Another 
useful location for insertion of haptens is the hydrophobic region (encompassing 
approximately amino acid positions 264 to 293). it is shown in the present invention 
that this region can be deleted without affecting the reactivity of the deleted E1 protein 
with antisera. Therefore, haptens may be inserted at the site of the deletion. 

The immunogenic compositions are conventionally administered parenterally, 
typically by injection, for example, subcutaneousiy or intramuscularly. Additional 
formulations suitable for other methods of administration include oral formulations and 
suppositories. Dosage treatment may be a single dose schedule or a multiple dose 
schedule. The vaccine may be administered in conjunction with other immunoregufatory 
agents. 

The present invention also relates to a composition comprising peptides or 
polypeptides as described above, for in vitro detection of HCV antibodies present in a 
biological sample. 

The present invention also relates to the use of a composition as described above 
for the preparation of an immunoassay kit for detecting HCV antibodies present in a 
biological sample. 

The present invention also relates to a method for in vitro diagnosis of HCV 
antibodies present in a biological sample, comprising at least the following steps : 
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(i) contacting said biological sample with a composition comprising any of 
the envelope peptide or proteins as defined above, preferably in an 
immobilized form under appropriate conditions which ailo w the formation 
of an immune complex, wherein said peptide or protein can be a 
biotinylated peptide or protein which is covaientfy bound to a solid 
substrate by means of streptavidin or avidin complexes, 

(ii) removing unbound components, 

(iii) incubating the immune complexes formed with heterologous antibodies, 
with said heterologous antibodies having conjugated to a detectable label 
under appropriate conditions, 

(iv) detecting the presence of said immune complexes visually or 
mechanically (e.g. by means of densitometry, fiuorimetry, colorimetry). 

Alternatively, the present invention also relates to competition immunoassay 
formats in which recombinantiy produced purified single or specific oligomeric protein 
El and/or E2 and/or E1/E2 proteins as disclosed above are used in combination with E1 
and/or E2 peptides in order to compete for HCV antibodies present in a biological 
sample. 

The present invention also relates to a kit for determining the presence of HCV 
antibodies, in a biological sample, comprising : 

at least one peptide or protein composition as defined above, possibiy in 
combination with other polypeptides or peptides from HCV or other types 
of HCV, with said peptides or proteins being preferentially immobilized on 
a solid substrate, more preferably on different microwells of the same 
ELISA plate, and even more preferentially on one and the same membrane 
strip, 

a buffer or components necessary for producing the buffer enabling 
binding reaction between these polypeptides or peptides and the 
antibodies against HCV present in the biological sample, 
means for detecting the immune complexes formed in the preceding 
binding reaction, 

possibly also including an automated scanning and interpretation device 
for inferring the HCV genotypes present in the sample from the observed 
binding pattern. 

Tne immunoassay methods according to the present invention utilize single or 
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specific oiigomeric antigens from the 51 and/or E2 domains that maintain linear {in case 
of peptides) and conformational epitopes (single or specific oiigomeric proteins) 
recognized by antibodies in the sera from individuals infected with HCV. It is within the 
scope of the invention to use for instance single or specific oiigomeric antigens, dimeric 
5 antigens, as well as combinations of single or specific oiigomeric antigens. The HCV El 
and E2 antigens of the present invention may be employed in virtually any assay format 
that employs a known antigen to detect antibodies. Of course, a format that denatures 
the HCV. conformational epitope should be avoided or adapted. A common feature of 
all of these assays is that the antigen is contacted with the body component suspected 
10 of containing HCV antibodies under conditions that permit the antigen to bind to any 
such antibody present in the component. Such conditions will typically be physiologic 
temperature, pH and ionic strenght using an excess of antigen. The incubation of the 
antigen with the specimen is followed by detection of immune complexes comprised of 
the antigen. 

15 Design of the immunoassays is subject to a great deal of variation, and many 

formats are known in the art. Protocols may, for example, use solid supports, or 
immunoprecipitation. Most assays involve the use of labeled antibody or polypeptide; 
the labels may be, for example, enzymatic, fluorescent, chemiluminescent, radioactive, 
or dye molecules. Assays which amplify the signals from the immune complex are also 

20 known; examples of which are assays which utilize biotin and avidin or streptavidin, and 
enzyme-labeled and mediated immunoassays, such as 5LISA assays. 

The immunoassay may be, without limitation, in a heterogeneous or in a 
homogeneous format, and of a standard or competitive type. In a heterogeneous format, 
the polypeptide is typically bound to a solid matrix or support to facilitate separation of 

25 the sample from the polypeptide after incubation. Examples of solid supports that can 
be used are nitrocellulose (e.g., in membrane or microtiter well form), polyvinyl chloride 
(e.g., in sheets or microtiter wells), polystyrene latex (e.g., in beads or microtiter plates, 
poiyvinylidine fluoride (known as Immunolon™), diazotized paper, nylon membranes, 
activated beads, and Protein A beads. For example, Dynatech immunolon™ 1 or 

30 Immunlon™ 2 microtiter plates or 0.25 inch polystyrene beads (Precision Plastic Ball) 
can be used in the heterogeneous format. The solid support containing the antigenic 
polypeptides is typically washed after separating it from the test sample, and prior to 
detection of bound antibodies. Both standard and competitive formats are know in the 
art. 
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in a homogeneous forms:, the res: Sample is incubated with the combination of 
antigens in solution. For example, it may be under conditions that will precipitate any 
antigen-antibody complexes which are formed. Both standard and competitive formats 
for these assays are known in the art. 
5 In a standard format, the amount of HCV antibodies in the antibody-antigen 

complexes is directly monitored. This may be accomplished by determining whether 
labeled anti-xenogeneic {e.g. anti-human) antibodies which recognize an epitope on anti- 
HCV antibodies will bind due to complex formation. In a competitive format, the amount 
of HCV antibodies in the sample is deduced by monitoring the competitive effect on the 
10 binding of a known amount of labeled antibody (or other competing ligand) in the 
complex. 

Complexes formed comprising anti-KCV antibody (or in the case of competitive 
assays, the amount of competing antibody) are detected by any of a number of known 
techniques, depending on the format. For example, unlabeled HCV antibodies in the 

15 complex may be detected using a conjugate of anti-xenogeneic Ig complexed with a 
tabe! {e.g. an enzyme label). 

In an immunoprecipitation or aggiutination assay format the reaction between the 
HCV antigens and the antibody forms a network that precipitates from the solution or 
suspension and forms a visible layer or film of precipitate. If no anti-HCV antibody is 

20 present in the test specimen, no visible precipitate is formed. 

There currently exist three specific types of particle agglutination (PA) assays. 
These assays are used for the detection of antibodies to various antigens when coated 
to a support. One type of this assay is the hemagglutination assay using red blood cells 
(RBCs) that are sensitized by passively adsorbing antigen (or antibody) to the RBC. The 

25 addition of specific antigen antibodies present in the body component, if any, causes 
the RBCs coated with the purified antigen to agglutinate. 

To eliminate potential non-specific reactions in the hemagglutination assay, two 
artificial carriers may be used instead of RBC in the PA. The most common of these are 
latex particles. However, gelatin particles may also be used. The assays utilizing either 

30 of these carriers are based on passive agglutination of the particles coated with purified 
antigens. 

The HCV single or specififc oiigomeric E1 and/or E2 and/or E1/E2 antigens of the 
present invention comprised of conformational epitopes will typically be packaged in the 
form of a kit for use in these immunoassays. The kit will normally contain in separate 
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containers the native HCV antigen, control antibody formulations (positive and/or 
negative), labeled antibody when the assay format requires the same and signal 
generating reagents (e.g. enzyme substrate) if the label does not generate a signal 
directly. The native HCV antigen may be already bound to a solid matrix or separate 
with reagents for binding it to the matrix. Instructions (e.g. written, tape, CD-ROM, etc.) 
for carrying out the assay usually will be included in the kit. 

Immunoassays that utilize the native HCV antigen are useful in screening blood 
for the preparation of a supply from which potentially infective HCV is lacking. The 
method for the preparation of the blood supply comprises the following steps. Reacting 
a body component, preferably biood or a bicod component, from the individual donating 
biood with HCV E1 and/or E2 proteins of the present invention to allow an 
immunological reaction between HCV antibodies, if any, and the HCV antigen. Detecting 
whether anti-HCV antibody - HCV antigen complexes are formed as a result of the 
reacting. Blood contributed to the blood supply is from donors that do net exhibit 
antibodies to the native HCV antigens. El or E2. 

In cases of a positive reactivity to the HCV antigen, it is preferable to repeat the 
immunoassay to lessen the possibility of false positives. For example, in the large scale 
screening of blood for the production of blood products (e.g. biood transfusion, plasma. 
Factor VIII, immunoglobulin, etc.) 'screening' tests are typically formatted to increase 
sensitivity (to insure no contaminated blood passes! at the expense of specificity: i.e. 
the false-positive rate is increased. Thus, it is typical to only defer for further testing 
those donors who are 'repeatedly reactive'; i.e. positive in two or more runs of the 
immunoassay on the donated sample. However, for confirmation of HCV-positivity, the 
'confirmation' tests are typically formatted to increase specificity (to insure that no 
false-positive samples are confirmed) at the expense of sensitivity. Therefore the 
purification method described in the present invention for E1 and E2 wiir be very 
advantageous for including singie or specific oiigomeric envelope proteins into HCV 
diagnostic assays. 

The solid phase selected can include polymeric or glass beads, nitrocellulose, 
microparticies, microwells of a reaction tray, test tubes and magnetic beads. The signal 
generating compound can include an enzyme, a luminescent compound, a chromogen, 
a radioactive element and a chemiluminescent compound. Examples of enzymes include 
alkaline phosphatase, horseradish peroxidase and beta-galactosidase. Examples of 
enhancer compounds include biotin. anti-biotin and avidin. Examples of enhancer 
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compounds binding members include biotin. anti-biotm and avidin. In order to block the 
effects of rheumatoid factor-liRe substances, the test sample is subjected to conditions 
sufficient to block the effect of rheumatoid factor-like substances. These conditions 
comprise contacting the test sample with a quantity of anti-human igG to form a 
mixture, and incubating the mixture for a time and under conditions sufficient to form 
a reaction mixture product substantially free of rheumatoid factor-like substance. 

The present invention further contemplates the use of E1 proteins, or parts 
thereof, more particularly HCV single or specific oligomeric E1 proteins as defined 
above, for in vitro monitoring HCV disease or prognosing the response to treatment (for 
instance with interferon) of patients suffering from HCV infection comprising: 

incubating a biological sample from a patient with hepatitis C infection 

with an E1 protein or a suitable oar: thereof under conditions allowing the 

formation of an immunological complex, 

removing unbound components, 

calculating the anti-El titers present in said sample {for example at the 
start of and/or during the course of (interferon) therapy), 
monitoring the natural course of HCV disease, or prognosing the response 
to treatment of said patient on the basis of the amount anti-El titers 
found in said sample at the start of treatment and/or during the course of 
treatment. 

Patients who show a decrease of 2, 3, 4, 5, 7, 1 0, 1 5, or preferably more than 
20 times of the initial anti-E1 titers could be concluded to be long-term, sustained 
responders to HCV therapy, more particularly to interferon therapy. It is illustrated in the 
Examples section, that an anti-El assay may be very useful for prognosing long-term 
response to IFN treatment, or to treatment of Hepatitis C virus disease in general. 

More particularly the following E1 peptides as listed in Table 3 were found to be 
useful for in vitro monitoring HCV disease or prognosing the response to interferon 
treatment of patients suffering from HCV infection: 

El -31 (SEQ ID NO 56) spanning amino acids 181 to 200 of the Core/E1 V1 

region, 

E1-33 (SEQ ID NO 57) spanning amino acids 193 to 212 of the E1 region. 
E1-35 (SEQ ID NO 58) spanning amino acids 205 to 224 of the E1 V2 region 
(epitope B), 

E1-35A (SEQ ID NO 59) spanning amino acids 208 to 227 of the E1 V2 region 
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(epitope B), 

IbEl (S5Q ID NO 53) spanning amino acids 192 to 228 of E1 regions (VI, CI, 
and V2 regions (containing epitope B)), 

El-51 (SEQ ID NO 65) spanning amino acids 301 to 320 of the E1 region. 
E1-53 (SEQ ID NO 67) spanning amino acids 313 to 332 of the E1 C4 region 
(epitope A), 

El -55 (SEQ ID NO 53) spanning amino acids 325 to 344 of the El region. 

It is to be understood that smaller fragments of the above-mentioned peptides 
also fall within the scope of the present invention. Said smaller fragments can be easily 
prepared by chemical synthesis and can be tested for their ability to be used in an assay 
as detailed above and in the Examples section. 

The present invention also relates to a kit for monitoring HCV disease or 
prognosing the response to treatment (for instance to interferon) of patients suffering 
from HCV infection comprising: 

at least one E1 protein or El peptide, more particularly an E1 protein or 
El peptide as defined above, 

a buffer or components necessary for producing the buffer enabling the 
binding reaction between these proteins or peptides and the anti-El 
antibodies present in a biological sample, 

means for detecting the immune complexes formed in the preceding 
binding reaction, 

possibly also an automated scanning and interpretation device for 
inferring a decrease of anti-E1 titers during the progression of treatment. 

It is to be understood that also E2 protein and peptides according to the present 
invention can be used to a certain degree to monitor/prognose HCV treatment as 
indicated above for the El proteins or peptides because also the anti-E2 levels decrease 
in comparison to antibodies to the other HCV antigens. It is to be understood, however, 
that it might be possible to determine certain epitopes in the E2 region which would also 
be suited for use in an test for monitoring/prognosing HCV disease. 

The present invention also relates to a serotyping assay for detecting one or 
more serological types of HCV present in a biological sample, more particularly for 
detecting antibodies of the different types of HCV to be detected combined in one assay 
format, comprising at least the following steps : 

(i) contacting the biological sample to be analyzed for the presence of HCV 
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antibodies of one or more serological types, with at least one of the 51 
and/or E2 and/o^El /E2 protein compositions or at least one of the El or 
E2 peptide compositions as defined above, preferantiaily in an 
immobilized form under appropriate conditions which allow the formation 
of an immune complex. 



(ii) removing unbound components, 

(Hi) incubating the immune complexes formed with heterologous antibodies, 
with said heterologous antibodies being conjugated to a detectable label 
under appropriate conditions, 

(iv) detecting the presence of said immune complexes visually or 
mechanically (e.g. by means of densitometry, fiuorimetry, coiorimetry) 
and inferring the presence of one or mere HCV serological types present 
from the observed binding pattern. 

It is to be understood that the compositions of proteins or peptides used in this 
method are recombinant^ expressed type-specific envelope' proteins or type-specific 
peptides. 

The present invention further relates to a kit for serotyping one or more 
serological types of HCV present in a biological sample, more particularly for detecting 
the antibodies to these serological types of HCV comprising: 

at least one El and/or E2 and/or E1/E2 protein or El or E2 peptide, as 

defined above, 

a buffer or components necessary for producing the buffer enabling the 
binding reaction between these proteins or peptides and the anti-El 
antibodies present in a biological sample, 

means for detecting the immune complexes formed in the preceding 
binding reaction, 

possibly also an automated scanning and interpretation device for 
detecting the presence of one or more serological types present from the 
observed binding pattern. 
The present invention also relates to the use of a peptide or protein composition 
as defined above, for immobilization on a solid substrate and incorporation into a 
reversed phase hybridization assay, preferably for immobilization as parallel lines onto 
a solid support such as a membrane strip, for determining the presence or the genotype 
of HCV according to a method as defined above. Combination with other type-specific 
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antigens from other HCV polyprotein regions also lies within the scope of the present 
invention. '• 
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Figure 15 
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Figure 1 8 : 
Figure 1 9 
Figure 20: 
Figure 21 : 



Figure 22: 



Figure 23: 



Restriction map of piasmid pgpr ATA 13 

Restriction map of piasmid pgs ATA 18 

Restriction map of piasmid pMS 66 

Restriction map of piasmid pv HCV-1 1 A 

Anti-E1 leve f s in non-responders to IFN treatment 

Anti-El levels in responders to (FN treatment 

Anti-E1 leveis in patients with complete response to IFN treatment 

Anti-El levels in incomplete responders co IFN treatment 

Anti-E2 levefs in non-responders to IFN treatment 

Anti-E2 leveis in responders to IFN treatment 

Anti-E2 levels in incomplete responcers to IFN treatment 

Anti-E2 levels in comclete resoonders to IFN treatment 

Human anti-El reactivity competed with peptides 

Competition of reactivity of anti-El monoclonal antibodies with peptides 

Anti-El (epitope 1) levels in non-responders to IFN treatment 

Anti-E1 (epitope 1) levels in resoonders to IFN treatment 

Anti-E1 (epitope 2) leveis in non-responders to IFN treatment 

Anti-E1 (epitope 2) levels in responders to IFN treatment 

Competition of reactivity of anti-E2 monoclonal antibodies with peptides 

Human anti-52 reactivity competed with peptides 

Nucleic acid sequences of the present invention. The nucleic acid 
sequences encoding an El or E2 protein according to the present 
invention may be translated (SEQ ID NO 3 to 13, 21-31, 35 and 41-49 
are translated in a reading frame starting from residue number 1, SEQ ID 
NO 37-39 are translated in a reading frame starting from residue number 
2), into the amino acid sequences of the respective El or E2 proteins as 
shown in the sequence listing. 

EUSA results obtained from lentil lectin chromatography eluate fractions 
of 4 different E 1 purifications of cell lysates infected with vvHCV39 (type 
1b), wHCV40 (type 1b), wHCV62 .zype 3a), and vvHCV63 (type 5a). 
Eiution profiles obtained from the lentil lectin chromatography of the 4 
different E1 constructs on the basis of the values as shown in Figure 22. 
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Figure 24: ELISA results obtained from fractions obtained after gelfiitration 
chromatography v of 4 different El purifications of cell iysates infected 
with vvHCV39 (type lb), vvHCV^o (type lb), vvHCV62 (type 3a), and 
vvHCV63 (type 5a). 

5 Figure 25: Profiles obtained from purifications of El proteins of type 1 b (1), type 3a 
(2), and type 5a (3) (from RKT3 cells infected with vvHCV39, vvHCV62, 
and vvHCV63, respectively; purified on lentil lectin and reduced as in 
example 5.2 - 5.3) and a standard W. The peaks indicated with 'V, '2\ 
and '3', represent pure El protein peaks (see Figure 24, El reactivity 

10 mainly in fractions 25 to 30). 

Figure 26: Silver staining of an SDS-PAGE as described in exampie 4 of a raw lysate 
of E1 vvHCV40 (type 1 b) (lane 1 ), poci 1 of the gelfiitration of wKCV40 
representing fractions 10 to 17 as shown in Figure 25 (lane 2), poof 2 of 
the gelfiitration of vvHCV40 representing fractions 1 3 to 25 as shown in 

15 Figure 25 (lane 3), and El pool (fractions 25 to 30) (iane 4). 

Figure 27: Streptavidine-aikaline phosphatase blot of the fractions of the gelfiitration 
of E1 constructs 39 (type 1b) and 62 (type 3a). The proteins were 
labelled with NEM-biotin. Lane 1: start gelfiitration construct 39, lane 2 
fraction 26 construct 39, iane 3: fraction 27 construct 39, lane 4 

20 fraction 28 construct 39, lane 5: fraction 29 construct 39, iane 6 

fraction 30 construct 39, lane 7 fraction 31 construct 39, lane 8 
molecular weight marker, lane 9: start gelfiitration construct 62. lane 1 0 
fraction 26 construct 62, lane 11: fraction 27 construct 62, iane 12 
fraction 28 construct 62, lane 13: fraction 29 construct 62, lane 14 

25 fraction 30 construct 62, lane 15: fraction 31 construct 62. 

Figure 28: Siver staining of an SDS-PAGE gel of the gelfiitration fractions of vvHCV- 
39 (Els, type lb) and vvHCV-62 (Els, type 3a) run under identical 
conditions as Figure 26. Lane 1: start gelfiitration construct 39, lane 2 
fraction 26 construct 39, iane 3: fraction 27 construct 39, iane 4 

30 fraction 28 construct 39, lane 5: fraction 29 construct 39, lane 6 

fraction 30 construct 39, lane 7 fraction 31 construct 39, lane 8 
molecular weight marker, lane 9: start gelfiitration construct 62, lane 1 0 
fraction 26 construct 62, lane 11: fraction 27 construct 62, lane 12 
fraction 28 construct 62, lane 13: fraction 29 construct 52, fane 14 
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fraction 30 construe: 62, lane 15: fraction 31 construct 62. 
Figure 29: Western Blot arfalysis with anti-El mouse monoclonal antibody 5E1 Al G 
giving a complete overview of the purification procedure. Lane 1: crude 
lysate, Lane 2: flow through of lentil chromagtography. Lane 3: wash 
5 with Empigen BB after lentil chromatography. Lane 4: Eluate of lentil 

chromatography, Lane 5: Flow through during concentration of the lentil 
eluate. Lane 6: Pool of E1 after Size Exclusion Chromatography 
(gelfiitration). 

Figure 30: OD 230 profile (continuous line) of the lentil lectin chromatography of E2 
1 0 protein from RK13 cells infected with vvHCV44. The dotted line 

represents the E2 reactivity as detected by ELISA (as in example 6). 
Figure 31 A: OD 2sc profile (continuous line) of the lentil-lectin gelfiitration 

chromatography E2 protein pool from RK1 3 cells infected with vvHCV44 

in which the E2 pool is apolisd immediately on the gelfiitration column 
1 5 (non-reduced cond ; tions). The dotted line represents the E2 reactivity as 

detected by ELISA (as in example 6). 
Figure 31 B: OD 220 profile (continuous line) of the lentil-lectin gelfiitration 

chromatography E2 protein pool from RK1 3 cells infected with vvHCV44 

in which the E2 pool was reduced and blocked according to Example 5.3 
20 (reduced conditions). The dotted line represents the E2 reactivity as 

detected by ELISA (as in example 6). 
Figure 32: Ni 2 *-1MAC chromatography and ELISA reactivity of the E2 protein as 

expressed from vvKCV44 after gelfiitration under reducing conditions as 

shown in Figure 31 B. 

25 Figure 33: Silver staining of an SDS-PAGE of 0.5 //g of purified E2 protein recovered 

by a 200 mM imidazole elution step (lane 2) and a 30mM imidazole wash 
(lane 1) of the Ni 2 *-1MAC chromatography as shown in Figure 32. 
Figure 34: OD profiles of a desalting step of the purified E2 protein recovered by 
200 mM immidazole as shown in Figure 33, intended to remove 

30 imidazole. 

Figure 35 A: Antibody levels to the different HCV antigens (Core 1, Core 2, E2HCVR, 
NS3) for NR and LTR followed during treatment and over a period of 6 to 
1 2 months after treatment determined by means of the LlAscan method. 
The average values are indicated by the curves with the open squares. 



WO 96/04385 PCT/EP95/0303I 

39 

Figure 35B: Antibody levels to the different HCV antigens (NS4, NS5, El and £2) for 
NR and LTR followed during treatment and over a period of 6 to 12 
months after treatment determined by means of the LIAscan method. The 
avergae vallues are indicated by the curve with the open squares. 

5 

Figure 36: Average El antibody (El Ab) and 52 antibody (E2Ab) levels in the LTR 
and NR groups. 

Figure 37: Averages El antibody (E1Ab) levels for non-responders (NR) and long 
term responders (LTR) for type lb and type 3a. 
10 Figure 38: Relative map positions of the anti-E2 monoclonal antibodies. 

Figure 39: Partial degiycosylaticn of HCV El envelope protein. The iysate of 
vvHCVl OA-infected RK13 ceils were incubated with different 
concentrations of giycosicases according to the manufacturer's 
instructions. Right panel: Glycopepticase F (PNGase F). Left panel: 
15 Endoglycosidase H (Endo H). 

Figure 40: Partial deglycosytation of HCV E2 envelope proteins. The Iysate of 
vvHCV64-infected (E2) and vvHCV41 -infected (E2s)RK13 cells were 
incubated with different concentrations of Qlycopeptidase F (PNGase F) 
according to the manufacturer's instructions. 
20 Figure 41: in vitro mutagenesis of HCV El glycoproteins. Map of the mutated 

sequences and the creation of new restriction sites. 

Figure 42A: in vitro mutagenesis of HCV El glycoprotein (part 1). First step of PCR 
amplification. 

Figure 42B: In vitro mutagensrs of HCV El glycoprotein (part 2). Overlap extension 
25 and nested PCR. 

Figure 43: In vitro mutagesesis of HCV El glycoproteins. Map of the PCR mutated 

fragments (GLY-# and OVR-£) synthesized during the first step of 

amplification. 

Figure 44A: Analysis of El glycoprotein mutants by Western blot expressed in HeLa 
30 (left) and RK13 (right) cells. Lane 1: wild type VV (vaccinia virus). Lane 

2: original El protein (vvHCV-1 OA), Lane 3: El mutant Gly-1 (vvHCV-8U. 
Lane 4: El mutant Gly-2 (vvHCV-82), Lane 5: El mutant Gly-3 (vvHCV- 
83), Lane 6: El mutant Gly-4 (vvHCV-84), Lane 7: El mutant Gly-5 
(vvHCV-85), Lane 8: E1 mutant Gly-6 (vvHCV-86)- 
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Figure 443: Analysis of £1 giycosyiation mutant vaccinia viruses by PCS 
amplification/restriction. Lane 1: El (vvHCV-lQAl. BspE L Lane 2 
E1.GLY-1 (vvHCV-81), BspE I, Lane 4; El (vvHCV-IOAj. Sac L Lane 5 
E1.GLY-2 (wHCV-82), Sac /, Lane 7: El <vvHCV-10A), Sac I, Lane 3 
E1.GLY-3 (vvHCV-83), Sac I, Lane 10: El (vvHCV-lOA), Stu L Lane 11 
E1.GLY-4 (vvKCV-8^}, Stu L Lane 13: El (vvHCV-10A), Sma I Lane 14 
E1.GLY-5 (vvHCV-85), Sma I, Lane 16: E1 (vvHCV-lOA), Stu I, Lane 17 
El.GLY-S (vvHCV-36), Stu I, Lane 3-6-9-12-15: Low Molecular 
Weight Marker, pBIuescript SK + , Msp L 

SDS polyacryiamide ge! electrophoresis of recombinant E2 expressed in 
S. cerevisiae . Innoculates were grown in leucine selective medium for 72 
hrs. and diluted 1/15 in complete medium. After 10 days of culture at 
28°C, medium samples were taken. The equivalent of 200 p\ of culture 
supernatant concentrated by speecvac was loaded on the gel. Two 
15 independent transformants were analysed. 

Figure 46: SOS polyacryiamide ge! electrophoresis of recombinant E2 expressed in 
a giycosyiation deficient S. cerevisiae mutant. Innoculae were grown in 
leucine selective medium for 72 hrs. and diluted 1/15 in complete 
medium. After 1 0 days of culture at 28 °C, medium samples were taken. 
20 The equivalent of 350 fj\ of culture supernatant, concentrated by ion 

exchange chromatography, was loaded on the gel. 
Table 1 : Features of the respective clones and primers used for amplification for 
constructing the different forms of the E1 protein as despected in 
Example 1. 

25 Table 2 : Summary of Anti-E1 tests 

Table 3 : Synthetic peptides for competition studies 
Table 4: Changes of envelope antibody levels over time. 
Table 5: Difference between LTR and NR 

Table 6: Competition experiments between murine E2 monoclonal antibodies 
30 Table 7: Primers for construction of El giycosyiation mutants 
Table 8: Analysis of E1 giycosyiation mutants by ELISA 
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Example 1: Cloning and expression of the hepatitis C virus El protein 
1. Construction of vaccinia virus recombination vectors 



5 The pgptATAI 8 vaccinia recombination plasmid is a modified version of pATAI 3 

(Stunnenberg et al, 1988) with an additional insertion containing the E. coli xanthine 
guanine phosphoribosyl transferase gene under the control of the vaccinia virus 13 
intermediate promoter (Figure 1 ). The piasmid pgsATAl 3 was constructed by inserting 
an oligonucleotide linker with SEQ ID NO 1/94, containing stop codons in the three 
1 0 reading frames, into the Pst I and Hindlil-cut pATAI 8 vector. This created an extra Pac 
I restriction site {Figure 2). The original Hindlli site was not restored. 



Oligonucleotide linker with SEQ ID NO 1/94; 

5' G GCATGC AAGCTT AATTAATT 3' 

15 3' ACG7C CGTACG TTCGAA T7AATTAA 7C3A E ' 



?szi sph: Kind::: ?ac i {Hir.i:::: 



In order to facilitate rapid and efficient purification by means of Ni 2 ~ chelation 
20 of engineered histidine stretches fused to the recombinant proteins, the vaccinia 
recombination vector pMSco was designed to express secreted proteins with an 
additional carboxy-terrninal histidine tag. An oligonucleotide linker with SEQ ID NO 2/95, 
containing unique sites for 3 restriction enzymes generating blunt ends (Sma I, Stu I and 
Pml !/Bbr PI) was synthesized in such a way that the carboxy-terminar end of^any cDN A 
25 couid be inserted in frame with a sequence encoding the protease factor Xa cleavage 
site followed by a nucleotide sequence encoding 6 histidines and 2 stop codons (a new 
Pac 1 restriction site was also created downstream the 3'end). This oligonucleotide with 
SEQ ID NO 2/95 was introduced between the Xma 1 and Pst I sites of pgptATAI 8 
{Figure 3). 

30 Oligonucleotide linker with SEQ ID NO 2/95: 

'5' CCGGG GAQGCCIG CACGTGA7CGAGGG CAGACAC CATCAC GAC CA7C AC7AA7AG77AA77AA CTGZAJ 
3 ' C CTCCGGACG7GCAC7AGC7CCCG7C7G7GG7AG7GG7GG7AG7GA77A7CAA7TAA77 G 



Xmal 
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Example 2. Construction of HCV recombinant olasmids 

2.1. Constructs encoding different forms of the E1 protein 

Polymerase Chain Reaction (PCR) products were derived from the serum samples 
by RNA preparation and subsequent reverse-transcription and PCR as described 
previously (Stuyver et a!., 1 993b). Table 1 shows the features of the respective clones 
and the primers used for amplification. The PCR fragments were cioned into the Sma 
t-cut pSP72 (Promega) piasmids. The following clones were selected for insertion into 
vaccinia reombination vectors: HCCI9A (SEQ ID NO 3), HCCITOA (SEQ ID NO 5), 
HCCI11A (SEQ ID NO 7), HCCI12A (SEQ ID NO 9}, HCC113A (SEQ ID NO 11), and 
HCCI1 7A (SEQ ID NO 1 3) as depicted in Figure 21 . cDNA fragments containing the El - 
coding regions were cleaved by EcoRi and Kindlll restriction from the respective pSP72 
piasmids and inserted into the EcoRS/Hindlii-cut pgptATA-18 vaccinia recombination 
vector (described in example 1), downstream of the 1 1K vaccinia virus late promoter. 
The respective piasmids were designated pvHCV-9A, pvKCV-lGA, pvHCV-1 1 A, pvHCV- 
12A, pvHCV-13A and pvHCV-17A, of which pvHCV-llA is shown in Figure 4. 

2.2, Hydrophobic region E1 deletion mutants 

Clone HCC137, containing a deletion of codons Asp264 to Vai287 (nucleotides 
790 to 861, region encoding hydrophobic domain I) was generated as follows: 2 PCR 
fragments were generated from clone HCCI1 OA with primer sets HCPr52 (SEQ ID NO 
16}/HCPr107 (SEQ ID NO 19) and HCPM08 (SEQ ID NO 201/HCPR54 (SEQ ID NO 18). 
These primers are shown in Figure 21. The two PCR fragments were purified from 
agarose gel after electrophoresis and 1 ng of each fragment was used together as 
template for PCR by means of primers KCPr52 (SEQ ID NO 1 6) and HCPr54 (SEQ ID NO 
18). The resulting fragment was cioned into the Sma l-cut pSP72 vector and clones 
containing the deletion were readily identified because of the deletion of 24 codons (72 
base pairs). Piasmid pSP72HCC!37 containing clone HCC137 (SEQ ID 1 5) was selected. 
A recpmbinant vaccinia piasmid containing the full-length El cDNA lacking hydrophobic 
domain ! was constructed by inserting the HCV sequence surrounding the deletion 
(fragment cleaved by Xma ! and 3amH i from the vector pSP72-HCCI37) into the Xma 
l-Bam H 1 sites of the vaccinia piasmid pvHCV-10A. The resulting piasmid was named 
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pvHCV-37. After confirmatory sequencing, the ammo-terminal region containing the 
internal deletion was isolated from this vector pvHCV-37 (cleavage by EcoR 1 and 3stE 
ID and reinserted into the 5co Rl and Bst Eli-cut pvHCV-1 1 A plasmid. This construct 
was expected to express an E1 protein with both hydrophobic domains deleted and was 
named pvHCV-38. The El -coding region of clone HCCI38 is represented by SEQ ID NO 
23. 

As the hydrophiiic region at the E1 carboxyterminus (theoretically extending to 
around amino acids 337-340) was not completely included in construct pvHCV-38, a 
larger El region lacking hydrophobic domain I was isolated from the pvHCV-37 plasmid 
by EcoR i/Bam HI cleavage and cloned into an EcoRI/BamHI-cut pgsATA-1 3 vector. The 
resulting piasmid was named pvHCV-39 and contained clone HCCI39 (SEQ ID NO 25). 
The same fragment was cleaved from the pvHCV-37 vector by BamH I (of which the 
sticky ends were filled with Klenow DNA Polymerase I (Boehringer)) and subsequently 
by EcoR 1 (5' cohesive end). This sequence was inserted into the EcoRI and Bbr Pl-cut 
vector pMS-S5. This resulted in clone HCG40 (SEQ ID NO 27) in plasmid pvHCV-40, 
containing a 6 histidine tail at its carboxy-terminal end. 

2.3. E1 of other Qenotvoes 

Clone HCCI62 (SEQ ID NO 29) was derived from a type 3a-infected patient with 
chronic hepatitis C (serum SR36, clone 8R36-9-13, SEQ ID NO 19 in WO 94/25601/ 
and see also Stuyver et al. 1993a) and HCC163 (SEQ ID NO 31) was derived from a 
type 5a-infected child with post-transfusion hepatitis (serum BE95, clone PC-4-1, SEQ 
ID NO 45 in WO 94/25601). 

2.4. E2 constructs 

The HCV E2 PCR fragment 22 was obtained from serum BE1 1 {genotype 1 b) by 
means of primers HCPrl09 (SEQ ID NO 33) and KCPr72 (SEQ ID NO 34) using 
techniques of RNA preparation, reverse-transcription and PCR, as described in Stuyver 
et al., 1993b, and the fragment was cloned into the Sma l-cut pSP72 vector. Clone 
HCCI22A (SEQ ID NO 35) was cut with Ncoi/AlwNi or by BamHi/AlwNi and the sticky 
ends of the fragments were blunted (Ncol and BamHi sites with Klenow DNA 
Polymerase 1 (Boehringer), and AiwNI with T4 DNA polymerase (Boehringer)). The 
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BamHI/AIwNI cDNA fragment was then inserted into the vaccinia pgsATA-18 vector 
that had been linearized by EcdR i and Hind III cleavage and of which the cohesive ends 
had been filled with Klenow DNA Polymerase (Boehringer). The resulting plasmid was 
named pvHCV-41 and encoded the E2 region from amino acids Met347 to GIn673. 
5 including 37 amino acids (from Met347 to Giy383) of the E1 protein that can serve as 
signal sequence. The same HCV cDNA was inserted into the EcoR i and Bbr Pl-cat 
vector pMS66, that had subsequently been blunt ended with Klenow DNA Polymerase. 
The resulting plasmid was named pvHCV-42 and also encoded amir • acids 347 to 633. 
The NcoI/AlwNI fragment was inserted in a similar way into the same sites of pgsATA- 
10 18 (pvHCV-43) or pMS-SS vaccinia vectors (pvHCV-44). pvHCV-43 and pvHCV-44 
encoded amino acids 364 to 673 of the HCV poiyprotein, of which amino acids 364 to 
383 were derived from the natural carboxyterminai region of the E1 protein encoding 
the signal sequence for E2, and amino acids 38- to 573 of the mature E2 protein. 

1 5 2.5. Generation of recombinant HCV-vaccinis viruses 

Rabbit kidney RK 1 3 cells !ATCC CCL 37), human osteosarcoma 1 43B thymidine 
kinase deficient (TK*> (ATCC CRL 8303), HeLa (ATCC CCL 2). and Hep G2 (ATCC HB 
8065) cell lines were obtained from the American Type Culture Collection (ATCC, 

20 Rockville, Md, USA). The ceils were grown in Dulbecco's modified Eagle medium 
(DMEM) supplemented with 10 % foetal caif serum, and with Earie's salts (EMEM) for 
RK13 and 143 B (TK-), and with glucose (4 g/l) for Hep G2. The vaccinia virus WR 
strain (Western Reserve, ATTC VR119) was routinely propagated in either 143B or 
RK13 cells, as described previously {Panicait St Paoietti, 1982; Piccini et aL, 1987; 

25 Mackett et al., 1982, 1984, and 1986). A confluent monolayer of 143B ceils was 
infected with wild type vaccinia virus at a multiplicity of infection (m.o J.) of 0.1 ( = 0.1 
plaque forming unit (PFU) per cell). Two hours later, the vaccinia recombination plasmid 
was transfected into the infected ceils in the form of a calcium phosphate coprecipitate 
containing 500 ng of the plasmid DNA to allow homologous recombination {Graham & 

30 van der Efa, 1973; Mackett et a!., 1985). Recombinant viruses expressing the 
Escherichia coli xanthine-guanine phosphoribosyl transferase (gpt) protein were selected 
on rabbit kidney RK1 3 cells incubated in selection medium (EMEM containing 25 pg/mt 
mycophenolic acid 0V1PA), 250pg/mi xanthine, and 15pg/mi hypoxanthtne; Falknerand 
Moss, 1 988; Janknecht et al, 1 991 ). Single recombinant viruses were purified on fresh 
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monolayers of RK1 3 cells under a 0.9% agarose overiay in selection medium. Thymidine 
kinase deficient (TK") recombinant viruses were selected and then plaque purified on 
fresh monolayers of human 1438 cells (TK-) in the presence of 25 //g/ml 5-bromo-2'- 
deoxyuridine. Stocks of purified recombinant HCV-vaccinia viruses were prepared by 
5 infecting either human 143 3 or rabbit RK13 ceils at an m.o.i. of 0.05 (Mackett et al, 
1988). The insertion of the HCV cDNA fragment in the recombinant vaccinia viruses 
was confirmed on an aiiquot (50 p\) of the cell Sysate after the MP A selection by means 
of PCR with the primers used to clone the respective HCV fragments (see Table 1 ). The 
recombinant vaccinia-HCV viruses were named according to the vaccinia recombination 
10 piasmid number, e.g. the recombinant vaccinia virus vvHCV-lOA was derived from 
recombining the wild type WR strain with the pvHCV-10A piasmid. 

Example 3: infection of ceils with recombinant vaccinia viruses 

15 A confluent monolayer of RK13 ceils was infected at a m.o.i. of 3 with the 

recombinant HCV-vaccinia viruses as described in example 2 . For infection, the ceil 
monolayer was washed twice with phosphate-buffered saline pH 7.4 (PBS) and the 
recombinant vaccinia virus stock was diluted in MEM medium. Two hundred //! of the 
virus solution was added per 1 0 e cells such that the m.o.i. was 3, and incubated for 45 

20 min at 24 9 C. The virus solution was aspirated and 2 ml of complete growth medium 
(see example 2) was added per 1Q 5 ceiis. The cells were incubated for 24 hr at 37 C C 
during which expression of the HCV proteins took place. 

Example 4: Analysis of recombinant proteins bv means of western blotting 

25 

The infected cells were washed two times with PBS, directly lysed with lysis 
buffer (50 mM Tris.HCI pH 7.5, 150 mM NaCI, 1 % Triton X-100, 5 mM MgCS 2 , 1 //g/ml 
aprotinin (Sigma, Bornem, Belgium)) or detached from the flasks by incubation in 50 mM 
Tris.HCL pH 7.5/ 10 mM EDTA/ 150 mM NaCi for 5 min, and collected by 
30 centrifugation (5 min at 1000g). The ceil pellet was then resuspended in 200 //I lysis 
buffer (50 mM Tris.HCL pH 8.0, 2 mM EDTA, 150 mM NaCI, 5 mM MgCl, aprotinin, 
1% Triton X-100) per 10 5 cells. The ceil lysates were cleared for 5 min at 14,000 
rpm in an Eppendorf centrifuge to remove the insoluble debris. Proteins of 20 y\ lysate 
were separated by means of sodium dodecyi sulphate-polyacrylamide gel electrophoresis 
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(3DS-PAG5L The proteins were then electro-transferred from the ge! to a nitrocellulose 
sr^et (Amersham) using a Hoefer HSi transfer unit cooled to 4°C for 2 hr at 100 V 
constant voltage, in transfer buffer (25 mM Tris.HCi pH 8.0, 192 mM glycine, 20% 
(v/v) methanol). Nitrocellulose filters were blocked with Blotto (5 % (w/v) fat-free 
instant milk powder in PBS; Johnson et aL 1981) and incubated with primary 
antibodies diluted in Blotto/0.1 % Tween 20. Usually, a human negative control serum 
or serum of a patient infected with HCV were 200 times diluted and preincubated for 
1 hour at room temperature with 200 times diluted wild type vaccinia virus-infeited cell 
lysate in order to decrease the non-specific binding. After washing with Biotto/0.1 % 
Tween 20, the nitrocellulose filters were incubated with alkaline phosphatase substrate 
solution diluted in Blotto/0.1 % Tween 20. After washing with 0.1 % Tween 20 in PBS, 
the filters were incubated with alkaiine phosphatase substrate solution (100 mM 
Tris.HCi pK 9.5, 100 mM Nad, 5 mM MgC!,, 0,38 //g/ml nitrobiue tetrazolium, 0.155 
yq/ml 5-bromo-4-chicro-3-indoiy!phosphat9). AH steps, except the efectrotransfer, were 
performed at room temperature. 

Example 5: Purification of recombinant El or E2 protein 
5.1 . Lvsis 

Infected RK13 cells (carrying E1 or E2 constructs) were washed 2 times with 
phosphate-buffered saline (PBS) and detached from the culture recipients by incubation 
in PBS containing 1 0 mM EDTA. The detached cells were washed twice with PBS and 
1 ml of lysis buffer {50 mM Tris.HCi pH 7.5, 150 mM NaCI, 1% Triton X-10O. 5 mM 
MgCI 2 , 1 //g/ml aprotinin {Sigma, Bornem, Belgium) containing 2 mM biotinylated N- 
ethylmaleimide (biotin-NEM) (Sigma) was added per 10- cells at 4'C. This lysate was 
homogenized with a type B douncer and left at room temperature for 0.5 hours. Another 
5 volumes of lysis buffer containing 10 mM N-ethyimaleimide {IMEM, Aidrich, Bornem, 
Belgium) was added to the primary lysate and the mixture was left at room temperature 
for 15 min. Insoluble cell debris was cleared from the solution by centrifugation in a 
Beckman JA-14 rotor at 14,000 rpm (301 00 g at r m j for 1 hour at4*C. 
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5.2. Lsctin Chromatography 

The cleared eel! lysate was loaded at a rate of 1 ml/min on a 0.8 by 1 0 cm Lentil- 
lectin Sepharose 48 column (Pharmacia) that had been equilibrated with 5 column 
volumes of lysis buffer at a rate of Irnl/min. The lentil-lectin column was washed with 
5 to 1 0 column volumes of buffer 1 (0.1M potassium phosphate pH 7.3, 500 mM KC\, 
5% glycerol, 1 mM 6-NH,-hexanoic acid, 1 mM MgCl 2 , and 1% DecyiPEG (KWANT, 
Bedum, The Netherlands). In some experiments, the column was subsequently washed 
with 10 column volumes of buffer 1 containing 0.5% Empigen-BB (Calbicchem, San 
Diego, CA, USA) instead of 1 % DecyiPEG. The bound material was eluted by applying 
elution buffer (10 mM potassium phosphate pH 7.3, 5% glycerol, 1 mM hexancic acid, 
ImM MgCU, 0.5% Empigen-BB, and 0.5 M a-methy!-mannopyranoside). The eluted 
material was fractionated and fractions were screened for the presence of El or E2 
protein by means of ELISA as described in example 5. Figure 22 shows ELISA results 
obtained from ientii lectin eluate fractions of 4 different E1 purifications of cell iysates 
infected with vvHCV39 (type lb), vvHCV40 (type lb), vvHCV62 (type 3a), and 
WHCVS3 (type 5a). Figure 23 shows the profiles obtained from the values shown in 
Figure 22. These results show that the lectin affinity column can be employed for 
envelope proteins of the different types of HCV. 

5.3. Concentration and partial reduction 

The El - or E2-positive fractions were pooled and concentrated on a Centricon 
30 kDa (Amicon) by centrifugation for 3 hours at 5,000 rpm in a Beckman JA-20 rotor 
at 4*C. In some experiments the El- or E2-positive fractions were pooled and 
concentrated by nitrogen evaporation. An equivalent of 3.10 s cells was concentrated 
to approximately 200 //I. For partial reduction, 30% Empigen-BB (Calbiochem, San 
Diego, CA, USA) was added to this 200 //I to a final concentration of 3.5 %, and 1M 
DTT in H 2 0 was subsequently added to a final concentration of 1.5 to 7.5 mM and 
incubated for 30 min at 37 *C. NEM (1M in dimethyisuiphoxide) was subsequently 
added to a final concentration of 50 mM and left to react for another 30 min at 37 "C 
to block the free sulphydryl groups. 
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5.^. Gel fiitrarion chromatography 

A Superdex-200 KR 10/20 column [Pharmacia) was equilibrated with 3 column 
volumes PBS/3% EmpigervBE. Tne reduced mixture was injected in a 500 //I sample 
5 loop of the Smart System (Pharmacia) and PBS/3% Empigen-BS buffer was added for 
gelfiltration. Fractions of 250 jji were collected from V c to v.. The fractions were 
screened for the presence of El or E2 protein as described in example 6. 

Figure 24 shows ELiSA results obtained from fractions obtained after gelfiltration 
chromatography of 4 different El purifications of eel! lysates infected with vvHCV39 

10 (type lb), wHCV40 (type lb), wKCV62 (type 3a!, and vvHCV53 {type 5a). Figure 25 
shows the profiles obtained from purifications of El proteins of types lb. 3a, and 5a 
{from RK13 ceils infected with vvrCV33, vvHCV52, and vvHCVS3, respectively; 
purified on lentil lectin and reduced as in the previous examples). The peaks indicated 
with 'V, '2' , and '3', represent pure El protein peaks (El reactivity mainly in fractions 

15 2S to 30). These peaks show very similar molecular weights of approximately 70 kDa, 
corresponding to dimeric El protein. Other peaks in the three profiles represent vaccinia 
virus and/or cellular proteins which couid be separated from E1 only because of the 
reduction step as outlined in example 5.3. and because of the subsequent gelfiltration 
step in the presence of the proper detergent. As shown in Figure 25 pool 1 

20 (representing fractions 10 to 17) and pool 2 (representing fractions 18 to 25) contain 
contaminating proteins net present in the El pool (fractions 26 to 30). The El pe3k 
fractions were ran on SDS/PAGE and blotted as described in example 4. Proteins 
labelled with NEM-biotin were detected by streptavidin-alkaline phosphatase as shown 
in Figure 27. It can be readily observed that, amongst others, the 29 kDa and 45kDa 

25 contaminating proteins present before the gelfiltration chromatography {lane 1 ) are only 
present at very low levels in the fractions 25 to 30. The band at approximately 55kDa 
represents the El dimeric form that could not be entirely disrupted into the monomeric 
El form. Similar results were obtained for the type 3a El protein (lanes 10 to 15). 
which shows a faster mobility on SDS/PAGE because of the presence of only 5 

30 carbohydrates instead of 5. Figure 23 shows a silver stain of an SDS/PAGE ge! run in 
identical conditions as in Figure 26. A complete overview of the purification procedure 
is given in Figure 29. 

The presence of purified El protein was further confirmed by means of western 
blotting as described in example 4. The dimeric El protein appeared to be non- 
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aggregated and free of contaminants. The subtype 1b El protein purified from 
wHCV40-infected cells according to the above scheme was aminoterminaily sequenced 
on an 477 Perkins-Elmer sequencer and appeared to contain a tyrosine as first residue. 
This confirmed that the El protein had been cleaved by the signal peptidase at the 
correct position (between A 191 and Y192) from its signal sequence. This confirms the 
finding of Hijikata et al. {1 991) that the aminoterminus of the mature E1 protein starts 
at amino acid position 192. 

5.5. Purification of the E2 protein 

The E2 protein (amino acids 384 to 673) was purified from RK1 3 ceils infected 
with vvKCV44 as indicated in Examples 5.1 to 5.4. Figure 30 shows the OC 2S , profile 
[continuous line) of the lentil lectin chromatography. The dotted line represents the E2 
reactivity as detected by ELISA (see example 5). Figure 31 shows the same profiles 
obtained from gelfiltration chromatography of the ientil-lectin E2 pool (see Figure 30), 
part of which was reduced and blocked according to the methods as set out in example 
5.3., and part of which was immediately applied to the column. Both parts of the E2 
pool were run on separate gelfiltration columns. It could be demonstrated that E2 forms 
covalently-linked aggregates with contaminating proteins if no reduction has been 
performed. After reduction and blocking, the majority of contaminating proteins 
segregated into the V 0 fraction. Other contaminating proteins copurified with the E2 
protein, were not covalently linked to the E2 protein any more because these 
contaminants could be removed in a subsequent step. Figure 32 shows an additional 
Ni 2 *-iMAC purification step carried out for the E2 protein purification. This affinity 
purification step employs the 6 histidine residues added to the E2 protein as expressed 
from vvHCV44. Contaminating proteins either run through the column or can be 
removed by a 30 mM imidazole wash. Figure 33 shows a silver-stained SDS/PAGE of 
0.5 pg of purified E2 protein and a 30 mM imidazole wash. The pure E2 protein could 
be easily recovered by a 200 mM imidazole eiution step. Figure 34 shows an additional 
desalting step intended to remove imidazole and to be able to switch to the desired 
buffer, e.g. PBS, carbonate buffer, saline. 

Starting from about 50,000 cm : of RK13 cells infected with vvHCV11A (or 
WHCV40) for the production of E1 or vvHCV41 , WHCV42, wHCV43, or vvHCV44 for 
production of E2 protein, the procedures described in examples 5.1 to 5.5 allow the 
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purification of approximately 1.3 nng of El protein and 0.5 mg of 52 protein. 

it should aiso be remarked that secreted E2 protein (constituting approximately 
30-40%, 60-70% being in the intracellular form) is chracterized by aggregate formation 
(contrary to expectations). The same problem is thus posed to purify secreted E2. The 
5 secreted E2 can be purified as disclosed above. 



Example 6: EL1SA for the detection of ami-El or anti-E2 antibodies or for the detection 
of El or E2 proteins 

10 

Maxisorb microweii piates (Nunc, Rcskilde, Denmark) were coated with 1 volume 
(e.g. 50 fj\ or 1 00 //I or 2C0 //I) per well of a 5 fjqlrrd solution of Streptavicin (Boehringer 
Mannheim) in PBS for 1 6 hours at 4*C or for 1 hour at 37 9 C. Alternatively, the wells 
were coated with 1 volume of 5 jjQtrnl of Galanthus nivalis agglutinin (GNA) in 50 mM 

15 sodium carbonate buffer pH 9.6 for 15 hours at 4*C or for 1 hour at 37 9 C. in the case 
of coating with GNA, the plates were washed 2 times with 400 //f of Washing Solution 
of the innotest HCV Ab III kit (Innogenetics, Zwijndrecht, Belgium). Unbound coating 
surfaces were blocked with 1.5 to 2 volumes of blocking solution {0.1 % casein and 
0:1% NaN 3 in PBS) for 1 hour at 37 *C or for 16 hours at 4*C. Blocking solution was 

20 aspirated. Purified E1 or E2 was diluted to 100-1000 ng/ml (concentration measured at 
A = 280 nm) or column fractions to be screened for El or E2 (see example 5), or E1 
or E2 in non-purified cell lysates (example 5.1.) were diluted 20 times in blocking 
solution, and 1 volume of the E1 or E2 solution was added to each well and incubated 
for 1 hour at 37 # C on the Streptavidin- or GNA-coated plates. The rnicrowells were 

25 washed 3 times with 1 volume of Washing Solution of the Innotest HCV Ab III kit 
(innogenetics, Zwijndrecht, Belgium). Serum samples were diluted 20 times or 
monoclonal anti-Si or anti-E2 antibodies were diluted to a concentration of 20 ng/mi in 
Sample Diluent of the Innotest HCV Ab 111 kit and 1 volume of the solution was left to 
react with the E1 or E2 protein for 1 hour at 37 *C. The rnicrowells were washed 5 

30 times with 400 }j\ of Washing Solution of the Innotest HCV Ab III kit (Innogenetics, 
Zwijndrecht, Belgium). The bound antibodies were detected by incubating each well for 
1 hour at 37 *C with a goat anti-human or anti-mouse IgG, percxidase-conjugated 
secondary antibody (DAKO, Gtostrup. Denmark) diluted 1/80,000 in 1 volume of 
Conjugate Diluent of the Innotest HCV Ab II! kit (Innogenetics, Zwijndrecht, Belgium), 
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and coior development was obtained by addition of substrate of the Innotes: HCV Ab 
III kit (Innogenetics. Zwijndrecht. Belgium, diluted TOO times in 1 volume of Substrate 
Solution of the Innotest HCV Ab III kit (Innogenetics. Zwijndrecht. Belgium] for 30 min 
at 24 *C after washing of the plates 3 times with 400 p\ of Washing Solution of the 
Innotest HCV Ab III kit (Innogenetics, Zwijndrecht, Belgium). 



Example 7: Follow up of patient groups with different clinical profiles 
7.1. Monitoring of anti-El and anti-=2 antibodies 

The current hepatitis C virus (HCV! diagnostic assays have been developed for 
screening and confirmation of the presence of HCV antibodies. Such assays do net 
seem to provide information usefui for monitoring of treatment or for prognosis of the 
outcome of disease. However, as is the case for hesatitis B, detection and quantification 
of anti-envelope antibodies may prove more useful in a clinical setting. To investigate 
the possibility of the use of anti-£1 antibody titer and anti-E2 antibody titer as 
prognostic markers for outcome of hepatitis C disease, a series of IFN-a treated patients 
with long-term sustained response (defined as patients with normal transaminase levels 
and negative HCV-RNA test (PCR in the 5'. non-coding region) in the blood for a period 
of at least 1 year after treatment) was compared with patients showing no resoonse or 
showing biochemical response with relapse at the end of treatment.. 

A group of 8 IFN-a treated patients with long-term sustained response (LTR, 
follow up 1 to 3.5 years, 3 type 3a and 5 type 1b) was compared with 9 patients 
showing non-complete responses to treatment (NR. follow up 1 to 4 years, 6 type 1b 
and 3 type 3a). Type 1b (vvHCV-39, see example 2.5.) and 3a E1 (wHCV-62, see 
example 2.5.) proteins were expressed by the vaccinia virus system (see examples 3 
and 4) and purified to homogeneity (example 5). The samples derived from patients 
infected with a type 1 b hepatitis C virus were tested for reactivity with purified type 1 b 
E1 protein, while samples of a type 3a infection were tested for reactivity of anti-type 
3a El antibodies in an EUSA as desribed in example 6. The genotypes of hepatitis C 
viruses infecting the different patients were determined by means of the Inno-LiPA 
genotyping assay (Innogenetics. Zwijndrecht. Belgium). Figure 5 shows the anti-El 
signal-to-noise ratios of these patients followed during the course of interferon 
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treatment and during the follow-up period after treatment. LTR cases consistently 
showed rapidly declining anti-El levels (with complete negativation in 3 cases), while 
anti-El levels of NR cases remained approximately constant. Some of the obtained anti- 
El data are shown in Table 2 as average S/N ratios ± SO (mean anti-El titer). The anti- 



Already at the end of treatment, marked differences could be observed between 
the 2 groups. Anti-El antibody titers had decreased 6.9 times in LTR but only 1.5 times 
in NR. At the end of follow up, the anti-El titers had declined by a factor of 22.5 in the 
patients with sustained response and even slightly increased in NR. Therefore, based 
on these data, decrease of anti-El antibody levels during monitoring of IFN-or therapy 
correlates with long-term, sustained response to treatment. The anti-El assay may be 
very useful for prognosis of long-term resconse to !FN treatment, or to treatment of the 
hepatitis C disease in general. 

This finding was not expected. On the contrary, the inventors had expected the 
anti-El antibody levels to increase during the course of IFN treatment in patients with 
long term response. As is the case for hepatitis 3, the virus is cleared as a consequence 
of the seroconversion for anti-H3sAg antibodies. Also in many other virus infections, the 
virus is eliminated when anti-envelope antibodies are raised. However, in the 
experiments of the present invention, anti-El antibodies clearly decreased in patients 
with a long-term response to treatment, while the antibody-level remained approximately 
at the same ievel in non-responding patients. Although the outcom.e of these 
experiments was not expected, this non-obvious finding may be very important and 
useful for clinical diagnosis of HCV infections. As shown in Figures 9, 10, 1 1, and 1 2, 
anti-E2 levels behaved very differently in the same patients studied and no obvious 
decline in titers was observed as for anti-El antibodies. Figure 35 gives a complete 
overview of the pilot study. 

As can be deduced from Table 2, the anti-El titers were on average at least 2 
times higher at the start of treatment in long term responders compared with incomplete 
responders to treatment. Therefore, measuring the titer of anti-El antibodies at the start 
of treatment, or monitoring the patient during the course of infection and measuring the 
anti-El titer, may become a useful marker for ciinical diagnosis of hepatitis C. 
Furthermore, the use of more defined regions of the El or E2 proteins may become 
desirable, as shown in example 7.3. 



5 



El titer could be deduced from the signal to noise ratio as show in Figures 5, 6, 7, and 
8. 
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7.2. Analysis of £1 a nd =2 antibodies m a larger patient cohnrr 

* ■ 

The pilot study lead the inventors to conclude that, in case infection was 
completely cleared, antibodies to the HCV envelope proteins changed more rapidly than 
antibodies to the more conventionally studied HCV antigens, with El antibodies 
changing most vigorously. We therefore included more type 1 b and 3a-infected LTR and 
further supplemented the cohort with a matched series of NR. such that both groups 
included 14 patients each. Some partial responders (PR) and responders with relapse 
(RR) were also analyzed. 

Figure 36 depicts average E1 antibody (El Abj and E2 antibody (E2Ab) levels in 
the LTR and MR groups and Tables 4 and 5 show the statistical analyses. In this larger 
cohort, higher E1 antibody levels before IFN-c therapy were associated with LTR (P < 
0.03). Since much higher E1 antibody levels were observed in type 3a-infected patients 
compared with type lb-infected patients (Figure 37), the genotype was taken into 
account (Table 4). Within the type lb-infected group, LTR also had higher El antibody 
levels than NR at the initiation of treatment [P < 0.05]; the limited number of type 3a- 
infected NR did not allow statistical analysis. 

Of antibody levels monitored in LTR during the 1 .5-year follow up period, only 
E1 antibodies cleared rapidly compared with levels measured at initiation of treatment 
(P = 0.0058, end of therapy; P = 0.0047 and ? = 0.0051 at 6 and 12 months after 
therapy, respectively]. This clearance remained significant within type 1- or type 3- 
infected LTR (average P values < 0.05). These data confirmed the initial finding that 
El Ab levels decrease rapidly in the early phase of resoivement. This feature seems to 
be independent of virai genotype. In NR, PR, or RR, no changes in any of the antibodies 
measured were observed throughout the follow up period. In patients who responded 
favourably to treatment with normalization of ALT levels and HCV-RNA negative during 
treatment, there was a marked difference between sustained responders (LTR) and 
responders with a relapse (RR). In contrast to LTR, RR did not show any decreasing El 
antibody levels, indicating the presence of occult HCV infection that could neither be 
demonstrated by PCR or other classical techniques for detection of HCV-RNA, nor by 
raised ALT levels. The minute quantities of viral RNA, still present in the RR group 
during treatment, seemed to be capable of anti-El B cell stimulation. Anti-El monitoring 
may therefore not only be able to discriminate LTR from NR. but also from RR. 



WO 96/04385 ^ r PCT/EP9S/0303I 

54 

7.3. Monitoring of antibodies of defined regions of the El protein 

Although the molecular biological approach of identifying HCV antigens resulted 
in unprecedented breakthrough in the development of virai diagnostics, the method of 
5 immune screening of /igt! 1 libraries predominantly yielded linear epitopes dispersed 
throughout the core and non-structural regions, and analysis of the envelope regions had 
to await cloning and expression of the E1/E2 region in mammaiian ceils. This approach 
sharply contrasts with many other viral infections of which epitopes to the envelope 
regions had already been mapped long before the deciphering of the genomic structure. 

10 Such epitopes and corresponding antibodies often had neutralizing activity useful for 
vaccine development and/or allowed the development of diagnostic assays with clinical 
or prognostic significance (e.g. antibodies to hepatitis 5 surface antigen). 
As no HCV vaccines or tests allowing clinical diagnosis and prognosis of hepatitis C 
disease are available today, the characterization of viral envelope regions exposed to 

1 5 immune surveillance may significantly contribute to new directions in HCV diagnosis and 
prophylaxis. 

Several 20-mer peptides (Table 3) that overlapped each other by 8 amino acids, 
were synthesized according to a previously described method (EP-A-0 489 9S8) based 
on the HC-J1 sequence (Okamoto et al., 1990). None of these, except peptide env35 

20 (also referred to as E1-35), was able to detect antibodies in sera of approximately 200 
HCV cases. Only 2 sera reacted slightly with the env35 peptide. However, by means 
of the anti-E1 ELISA as described in example 6, it was possible to discover additional 
epitopes as follows: The anti-El ELISA as described in example 6 was modified by 
mixing 50 //g/mi of El peptide with the 1/20 diluted human serum in sample diluent. 

25 Figure 13 shows the results of reactivity of human sera to the recombinant El 
(expressed from vvHCV-40) protein, in the presence of single or of a mixture of E1 
peptides. While only 2% of the sera could be detected by means of El peptides coated 
on strips in a Line Immunoassay format, over half of the sera contained anti-El 
antibodies which could be competed by means of the same peptides, when tested on 

30 the recombinant E1 protein. Some of the murine monoclonal antibodies obtained from 
Balb/C mice after injection with purified El protein were subsequently competed for 
reactivity to E1 with the single peptides (Figure 14). Clearly, the region of env53 
contained the predominant epitope, as the addition of env53 could substantially 
compete reactivity of several sera with E1 , and antibodies to the env3 1 region were also 
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detected. This finding was surprising, since the env33 and env31 peptides had not 
shown any reactivity when coaxed directly to the solid phase. 

Therefore peptides were synthesized using technology described by applicant 
previously (in WO 93/18054). The following peptides were synthesized: 

peptide env35A-biotin 

NH 2 -SNSSEAADM[MKTPGCV-GKbiotin (SEQ ID NO 51) 
spanning amino acids 208 to 227 of the HCV polyprotein in the El region 
peptide biotin-env53 ('epitope A'} 

biotin-GG-ITGHRMAWDMMMNWS?TTAL-COOH (SEQ ID NO 52) 
spanning amino acids to 313 of 332 of the HCV polyprotein in the El 
region 

peptide 1bE1 ('epitope B'i 

HjN-YEVRNVSGiYHVTNDCSNSSlVYEAADiVIIMhTFGCGK -biotin (SEQID 
NO 53) 

spanning amino acids 1 92 to 228 of the HCV polyprotein in the E1 region 
and compared with the reactivities of peptides Ela-BE (biotin-GG- 
TPTVATRDGKLPATQLRRKIDLL, SEQ ID NO 5±) and Elb-BB (biotin-GG- 
TPTLAARDASVPTTTIRRHVDLL, SEQ ID NO 55) which are derived from the same region 
of sequences of genotype 1 a and 1 b respectively and which have been described at the 
IXth international virology meeting in Glasgow, 1 3S3 ('epitope C). Reactivity of a panel 
of HCV sera was tested on epitopes A, B and C and epitope B was also compared with 
env35A (of 47 HCV-positive sera, 8 were positive on epitope B and none reacted with 
env35A). Reactivity towards epitopes A, B, and C was tested directly to the biotinylated 
peptides (50 ^g/ml) bound to streptavicin-coated plates as described in example 6. 
Clearly, epitopes A and B were most reactive while epitopes C and env35A-biotin were 
much less reactive. The same series of patients that had been monitored for their 
reactivity towards the complete El protein (example~7.1 .) was tested for reactivity 
towards epitopes A, B, and C. Little reactivity was seen to epitope C. while as shown 
in Figures 15, 16, 17, and 18, epitopes A and B reacted with the majority of sera. 
However, antibodies to the most reactive epitope (epitope A) did not seem to predict 
remission of disease, while the anti-1bE1 antibodies (epitope B) were present almost 
exclusively in long term responders at the start of IFN treatment. Therefore, anti-1 bE 1 
(epitope B) antibodies and anti-env53 (epitope A) antibodies could be shown to be 
useful markers for prognosis of hepatitis C disease. The env53 epitope may be 
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advantageously used for the detection of cross-reactive antibodies (antibodies that 
cross-react between major genbtypes) and antibodies to the env53 region may be very 
useful for universal El antigen detection in serum or liver tissue. Monoclonal antibodies 
that recognized the env53 region were reacted with a random epitope library. In 4 
5 clones that reacted upon immunoscreening with the monoclonal antibody 5E1 A1 0. the 
sequence -G WD- was present. Because of its analogy with the universal HCV sequence 
present in all HCV variants in the env53 region, the sequence A WD is thought to 
contain the essential sequence of the env53 cross-reactive murine epitope. The env31 
dearly also contains a variable region which may contain an epitope in the amino 

10 terminal sequence -YQVRNSTGL- (SEQ ID NO 931 and may be useful for diagnosis. 
Env31 or El -3 1 as shown in Table 3, is a par: of the peptide 1 bE 1 . Peptides 51-33 and 
El -51 also reacted to some extent with the murine antibodies, and peptide El -55 
(containing the variable region 6 (VS); spanning amino acid posirions 329-335) also 
reacted with some of the patient sera. 

1 5 Anti-E2 antibodies clearly followed a different pattern than the anti-El antibodies, 

especially in patients with a long-term response to treatment. Therefore, it is clear that 
the decrease in anti-envelope antibodies could not be measured as efficiently with an 
assay employing a recombinant E1 /E2 protein as with a single anti-El or anti-E2 protein. 
The anti-E2 response would clearly blur the anti-El response in an assay measuring both 

20 kinds of antibodies at the same time. Therefore, the ability to test anti-envelope 
antibodies to the single E1 and E2 proteins, was shown to be useful. 

7.4. Mapping of anti-E2 antibodies 

25 Of the 24 anti-E2 Mabs only three could be competed for reactivity to 

recombinant E2 by peptides, two of which reacted with the HVRI region (peptides E2- 
67 and E2-69, designated as epitope A) and one which recognized an epitope competed 
by peptide E2-13B (epitope O. The majority of murine antibodies recognized 
conformational anti-E2 epitopes (Figure 1 9). A human response to HVRI (epitope A), and 

30 to a lesser extent HVRII (epitope B) and a third linear epitope region (competed by 
peptides E2-23, E2-25 or E2-27, designated epitope E) and a fourth linear epitope region 
(competed by peptide E2-17B, epitope D) could also frequently be observed, but the 
* majority of sera reacted with conformational epitopes (Figure 20). These conformational 
epitopes could be grouped according to their relative positions as follows: the IgG 
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antibodies in the supernatant of hybridomas 15C8C1. 1 2D 11 FT. 9G3E6, 8G10D1H9. 
10D3C4, 4H6B2, 17F2C2, 5H6A7, 1 5B7A2 recognizing conformational epitopes were 
purified by means of protein A affinity chromatography and 1 mg;mi of the resulting 
IgG's were biotinyfated in borate buffer in the presence of biotin. Biotinyiated antibodies 
were separated from free biotin by means of geifiitration chromatography. Pooled 
biotinyiated antibody fractions were diluted 100 to 10,000 times. E2 protein bound to 
the solid phase was detected by the biotinyiated IgG in the presence of 100 times the 
amount of non-biotinylated competing antibody and subsequently detected by alkaline 
phosphatase labeled streptavidin. 

Percentages of competition are given in Table 6. Based on these results, 4 
conformational anti-E2 epitope regions (epitopes F, G, H and I) could be delineated 
(Figure 38). Alternatively, these Mabs may recognize mutant linear epitopes not 
represented by the peptides used in this study. Mabs 4H6B2 and 10D3C4 competed 
reactivity of 1 6ASE7, but unlike 1 SASE7, they did not recognize peptide E2-1 38. These 
Mabs may recognize variants of the same linear epitope {epitope C) or recognize a 
conformational epitope which is stericaily hindered or changes conformation after 
binding of 16A6E7 to the E2-13B region {epitope H). 



Example 8: El Qlvcosvlation mutants 

8.1. Introduction 

The E1 protein encoded by vvHCV10A, and the E2 protein encoded by vvHCV41 
to 44 expressed from mammaiian cells contain 6 and 11 carbohydrate moieties, 
respectively. This could be shown by incubating the lysate of vvHCVI OA-infected or 
vvHCV44-infected RK1 3 cells with decreasing concentrations of giycosidases (PNGase 
F or Endoglycosidase H, {Boehringer Mannhein Biochemical according to the 
manufacturer's instructions), such that the proteins in the lysate (including ED are 
partially deglycosyiated (Fig. 39 and 40, respectively). 
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Mutants devoid of some of their glycosylanon sites could allow the selection of 
envelope proteins with improved immunological reactivity. For HIV for example, gp 1 20 
proteins lacking certain selected sugar-addition motifs, have been found to be 
particularly useful for diagnostic or vaccine purpose. The addition of a new 
5 oligosaccharide side chain in the hemagglutinin protein of an escape mutant of the 
A/Hong Kong/3/6 8 (H3N2) influenza virus prevents reactivity with a neutralizing 
monoclonal antibody {Skene! et al, 1984). When novel glycosylation sites were 
introduced into the influenza hemagiutinin protein by site-specific mutagenesis, dramatic 
antigenic changes were observed, suggesting that the carbohydrates serve as a 

10 modulator of antigenicity {Gallagher et aL 1983). In another analysis, the 8 
carbohydrate-addition motifs of the surface protein gp7G of the Friend Murine Leukemia 
Virus were deleted. Although seven of the mutations did not affect virus infectivity, 
mutation of the fourth glycosylation signal with respect to the amino terminus resulted 
in a non-infectious phenotype (Kayman et al., 1 991 ). Furthermore, it is known in the art 

15 that addition of N-iinked carbohydrate chains is important for stabilization of folding 
intermediates and thus for efficient folding, prevention of malfoiding and degradation 
in the endoplasmic reticulum, oIigomeriz3tion, biological activity, and transport of 
glycoproteins (see reviews by Rose et al., 1988; Doms et a!., 1993; Helenius, 1994). 
After alignment of the different envelope protein sequences of HCV genotypes, 

20 it may be inferred that not all 6 giycosyiaticn sites on the HCV subtype 1b E1 protein 
are required for proper folding and reactivity, since some are absent in certain 
(sub)types. The fourth carbohydrate motif (on Asn251), present in types 1b, 6a, 7, 8, 
and 9, is absent in all other types know today. This sugar-addition motif may be 
mutated to yield a type 1 b El protein with improved reactivity. Also the type 2b 

25 sequences show an extra giycosyiation site in the V5 region (on Asn299). The isolate 
S33, belonging to genotype 2c, even lacks the first carbohydrate motif in the VI region 
(on Asn), while it is present on all other isolates (Stuyver et al., 1 994) However, even 
among the completely conserved sugar-addition motifs, the presence of the 
carbohydrate may not be required for folding, but may have a role in evasion of immune 

30 surveillance. Therefore, identification of the carbohydrate addition motifs which are not 
required for proper folding (and reactivity! is not obvious, and each mutant has to be 
analyzed and tested for reactivity. Mutagenesis of a glycosylation motif (NXS or NXT 
sequences) can be achieved by either mutating the codons for N, S, or T, in such a way 
that these codons encode amino acids different from N in the case of NL and/or ammo 
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acids different from S or 7 in the case of 3 and in the case of T. Alternatively, the X 
position may be mutated into Pj- since it is known that MPS or NPT are not frequently 
modified with carbohydrates. After establishing which carbohydrate-addition motifs are 
required for folding and/or reactivity and which are not, combinations of such mutations 
5 may be mads. 

8.2. Mutagenesis of the El protein 

Ail mutations were performed on the El sequence of clone HCCI10A (SEQ ID 

10 NO. 5). The first round of PCH was performed using sense primer 'QPT {see "able 7} 
targeting the GP7 sequence Iccated upstream of the vaccinia 1 1 K late promoter, and 
an antisense primer (designated GLY*, with # representing the number of the 
giycosylation site, see Fig. 41) containing the desired base change to obtain the 
mutagenesis. The six GLY£ primers {each specific for a given giycosylation site) were 

15 designed such that: 

- Modification of the codon encoding for the N-glycosyiated Asn (AAC or AAT) to a Gin 
codon (CAA or CAG). Giutamine was chosen because it is very similar to asparagine 
(both amino acids are neutral and contain non-polar residues, giutamine has a longer 
side chain (one more -CH 2 - group). 

20 - The introduction of silent mutations in one or several of the codons downstream of the 
giycosylation site, in order to create a new unique or rare (e.g. a second Snnal site for 
E1Giy5) restriction enzyme sits. Without modifying the amino acid sequence, this 
mutation will provide a way to distinguish the mutated sequences from the original E1 
sequence (pvHCV-10A) or from each other (Figure 41) .This additional restriction site 

25 may also be useful for the construction of new hybrid (double, triple/etc.) giycosylation 
mutants. 

- 1 8 nucleotides extend 5' of the first mismatched nucleotide and 1 2 to 1 6 nucleotides 
extend to the 3' end. Table 7 depicts the sequences of the six GLY# primers overlapping 
the sequence of N-linked giycosylation sites. 
30 For site-directed mutagenesis, the 'mispriming' or 'overlap extension' (Horton. 

1993) was used. The concept is illustrated in Figures 42 and 43. First, two separate 
fragments were amplified from the target gene for each mutated site. The PCR product 
obtained from the 5' end {product GLY£) was amplified with the 5' sense GPT primer 
.(see Table 7) and with the respective 3' antisense GlY* primers. The second fragment 
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(product OVR#) was amplified with the 3' antisense 7K B pnmer and the respective 5" 



The OVR# primers target part of the GLY* primer sequence. Therefore, the two 
groups of PCR products share an overlap region of identical sequence. When these 
intermediate products are mixed (GLY-1 with OVR-1, GlY-2 with OVR-2, etc.). melted 
at high temperature, and reannealed, the top sense strand of product GLY# can anneal 
to the antisense strand of product OVR# (and vice versa) in such a way that the two 
strands act as primers for one another (see Fig. 42.3.). Extension of the annealed 
overlap by Taq polymerase during two PCR cycles created the full-length mutant 
moiecule E1Gly#. which carries the mutation destroying the glycosylarion site number 
#. Sufficient quantities of the E1GIY;? products for cloning were generated in a third 
PCR by means of a common set of two internal nested primers. These two new primers 
are respectively overlapping the 3' end of the vaccinia 1 1 K promoter (sense GPT-2 
primer) and the 5' end of the vaccinia thymidine kinase locus (antisense TK R -2 primer, 
see Table 7). All PCR conditions were performed as described in Stuyver et al. (1 993). 

Each of these PCR products was cloned by EcoRI/BamHI cleavage into the 
EcoRI/BamHl-cut vaccinia vector containing the original El sequence (pvh'CV-1 OA). 

The selected clones were analyzed for length of insert by EcoRI/3amH I cleavage 
and for the presence of each new restriction site. The sequences overlapping the 
mutated sites were confirmed by double-stranded sequencing. 

8.3. Analysis of El Qlvcosvlation mutants 

Starting from the 6 plasmids containing the mutant E1 sequences as described 
in example 8.2. recombinant vaccinia viruses were generated by recombination with wt 
vaccinia virus as described in example 2.5. Briefly, 175 cm 3 -flasks of subconfluent 
RK1 3 cells were infected with the 6 recombinant vaccinia viruses carrying the mutant 
E1 sequences, as well as with the vvHCV-1 OA (carrying the non-mutated El sequence) 
and wt vaccinia viruses. Cells were lysed after 24 hours of infection and analyzed on 
western blot as described in example 4 (see Figure 44A). All mutants showed a faster 
mobility (corresponding to a smaller molecular weight of approximately 2 to 3 kDa) on 
SDS-PAGE than the original El protein; confirming that one carbohydrate moiety was 
not added. Recombinant viruses were also analyzed by PCR and restriction enzyme 
analysis to confirm the identity of the different mutants. Figure 44B shows that all 



sense primers (OVR^ primers, 'see Table 7. Figure 43). 
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mutants (as shown in Figure 41) contained the expected additional restriction sites. 
Another part of the cell iysate was used to test the reactivity of the different mutant by 
ELISA. The Iysates were diluted 20 times and added to microwefl plates coated with the 
lectin GNA as described in example 5. Captured (mutant) El glycoproteins were left to 
5 react with 20-times diluted sera of 24 HCV-infected patients as described in example 
6. Signal to noise (S/N) values {OD of GLY£/OD of wt) for the six mutants and E1 are 
shown in Table S. The tabie also shows the ratios between S/N values of GLY# and E1 
proteins. It should be understood that the approach to use ceil Iysates of the different 
mutants for comparison of reactivity with patient sera may result in observations that 

10 are the consequence of different expression levels rather then reactivity levels. Such 
difficulties can be overcome by purification of the different mutants as described in 
example 5, and by testing identical quantities of ail the different El proteins. However, 
the results shown in table 5 already indicate that removal of the 1st (GLY1), 3rd 
(GLY3), and 6th (GLY6) giycosyiation motifs reduces reactivity of some sera, while 

15 removal of the 2nd and 5th site does not. Removal of GLY4 seems to improve the 
reactivity of certain sera. These data indicate that different patients react differentiy to 
the giycosyiation mutants of the present invention. Thus, such mutant El proteins may 
be useful for the diagnosis (screening, confirmation, prognosis, etc.) and prevention of 
HCV disease. 

20 

Example 9: Expression of HCV E2 protein in givcosvlation-deficient veasts 

The E2 sequence corresponding to done HCCL41 was provided with the a- 
25 mating factor pre/pro signal sequence, inserted in a yeast expression vector and 

cerevisiae cells transformed with this construct secreted E2 protein into the growth 
medium. It was observed that most giycosyiation sites were modified with high- 
mannose type glycosylations upon expression of such a construct in S. cerevisiae 
strains (Figure 45). This resulted in a too high level of heterogeneity and in shielding of 
30 reactivity, which is not desirable for either vaccine or diagnostic purposes. To overcome 
this problem, S. cerevisiae mutants with modified giycosyiation pathways were 
generated by means of selection of vanadate-resistant clones. Such clones were 
analyzed for modified giycosyiation pathways by analysis of the molecular weight and 
heterogeneity of the glycoprotein invertase. This allowed us to identify different 
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giycosyiation deficient S. cerevisiae mutants. The E2 protein was subsequenriy 
expressed in some of the selected mutants and left to react with a monoclonal antibody 
as described in example 7, on western blot as described in example 4 (Figure 45). 



Example 10. General utility 

The present results show that not only a good expression system but also a good 
purification protocol are required to reach a high reactivity of the HCV envelope proteins 
with human patient sera. This can be obtained using the proper HCV envelope protein 
expression system and/or purification protocols of the present invention which 
guarantee the conservation of the natural folding of the protein and the purification 
protocols of the present invention which guarantee the elimination of contaminating 
proteins and which preserve the conformation, and thus the reactivity of the HCV 
envelope proteins. The amounts of purified HCV envelope protein needed for diagnostic 
screening assays are in the range of grams per year. For vaccine purposes, even higher 
amounts of envelope protein would be needed. Therefore, the vaccinia virus system 
may be used for selecting the best expression constructs and for limited upscaling, and 
large-scale expression and purification of single or specific oligomeric envelope proteins 
containing high-mannose carbohydrates may be achieved when expressed from several 
yeast strains. In the case of hepatitis B for example, manufacturing of HBsAg from 
mammalian cells was much more costly compared with yeast-derived hepatitis B 
vaccines. 

The purification method disicosed in the present invention may also be used for 
'viral envelope proteins' in general. Examples are those derived from Flaviviruses, the 
newly discovered GB-A, GB-B and GB-C Hepatitis viruses, Pestiviruses (such as Bovine 
viral Diarrhoea Virus (BVDV), Hog Cholera Virus (HCV), Border Disease Virus (BDV)), but 
also less related virusses such as Hepatitis B Virus (mainly for the purification of 
HBsAg). 

Tne envelope protein purification method of the present invention may be used 
for intra- as well as extracellulariy expressed proteins in lower or higher eukaryotic cells 
or in prokaryotes as set out in the detailed description section. 
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Table 1: Recombinant vsccmta piasmids and viruses 



Plasmid name 


Name 


* • 

cDNA subclone 
construction 


Length (nt/aal 


Vector used 
for insertion 


pvHCV-1 3A 


Els 


EcoR I - Hind lit 


472-157 


pgptA7A-18 


pvHCV-12A 


Els 


EcoR i - Hind III 


472/158 


pgptA7A-18 


pvHCV-9A 


E1 


EcoR t - Hind HI 


631/211 


pgptA7A-13 


pvHCV-11A 


Els 


EcoR I - Hind Hi 


625/207 


pgptA7A-13 


pvHCV-17A 


Els 


EcoR I - Hind ill 


625/208 


pgptA7A-18 


pvHCV-TOA 


E1 


EcoR I - Hind Mi 


783/262 


pgptA7A-18 


pvHCV-18A 


COREs 


Acc ! (KI) - EcoR I (KI) 


403/130 


pgp*A7A-18 


pvHCV-34 


CORE 


Acc i (Kit - Fso i 


595/197 


pgprATA- 1 3 


pvHCV-33 


CORE-51 


Acc I (K!) 


1150/380 


pgp:A7A-1 8 


dvHCV-35 


CORE-Elb.his 


EcoR I - BamH i (Ki) 


1032/352 


pMS-65 


pvH' .-36 


CORE-E1n.his 


EcoR I - Nco r (Ki) 


1106/376 


pMS-65 


pvHCV-37 


El A 


Xma I - BamH i 


711/239 


pvHCV-IQA 


pvHCV-38 


E1As 


EcoR I - Bs:E il 


553/183 


pvHCV-1 1A 


pvHCV-39 


El£b 


EcoR i - BamH I 


960/313 


pgsA7A-18 


pvHCV-40 


ElAb.his 


EcoR I - BamH i (KI) 


960/323 


pMS-65 


pvHCV-41 


E2fas 


BamH i (KI)-AIwN i (74) j 


---a 005/331 


pcsA7A-l 8 


pvHCV-42 


E2fas.his 


BamH i (Kl)-AiwN f (74} 


1005/341 


pMS-66 


pvHCV-43 


E2ns 


Nco I (Ki) - AiwN I (74) 


932/314 


pgsA7A-18 


pvHCV-44 


E2ns.his 


Nco I (Ki) - AlwN i (74) 


932/321 


pMS-65 


pvHCV-62 


E1s (type 3a) 


EcoR I - Hind III 


625/207 


pgsA7A-18 


pvHCV-63 


E1s {type 5) 


EcoR i - Hind tii 


* 625/207 


pgsA7A-1 8 


pvHCV-64 


E2 


BamH ! - Hind ill 


1410/463 


pgsA7A-18 


pvHCV-65 


E1-E2 


BamH I - Hind III 


2072/691 


pvHCV-10A 


pvHCV-66 


CORE-E1-E2 


BamH I - Hind HI 


2427/809 


pvHCV-33 



nt: nucleotide aa: aminoacid KI: Kienow DNA Pol filling 74; 74 DNA Pol filling 

Position: aminoacid position in the HCV polyprotein seauence 
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Table 1 - continued: RecombinanT vacri nia olasmids and viruses 



Plasmid 




HCV cDNA subclone 




Vector 


Name 


Name 


Construction 


Length 
(nt/aa) 


used for 
insertion 


ovHCV-81 




CwUni " Dam™ I 




pvMLV- 1 UA 


pvHCV-32 


El *-GLY 2 


EcoR! - BamH I 


783/262 


pvHCV-lOA 


pvHCV-83 


El *-GLY 3 


EcoRl - BamH I 


783/262 


pvHCV-10A 


pvHCV-84 


El *-GLY 4 


EcoRl - BamH I 


783/262 


pvHCV-lOA 


pvHCV-85 


E1 *-GLY 5 


EcoRI - BamH I 


783/252 


pvHCV-lOA 


pvHCV-86 


E1 *-GLY 6 


EcoR! - BamH I 


783/262 


pvHCV-lOA 



nt: nucleotide aa: aminoacid Ki: Kienow DNA Poi filling T4; T4 DNA Pol filling 

Position: aminoacid position in the HCV poiyprotein sequence 
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Table 2 : Summary of anti-El tests 

i. 

SIN +_ SD {mean anti-El titer) 





Start of treatment 


End of treatment 


Follow-up 


LTR 


6.94 +. 2.29 
(1:3946) 


4.48.^2.69 (1:568) 


2.99 ± 2.69 (1:175) 


NR 


5.77 ±_ 3.77 
(1:1607) 


5.29 ± 3.9S 

(1:1060) 


6.08 ± 3.73 

(1:1978) 



LTR : Long-term, sustained response for more than 1 year 
NR : No response, response with relapse, or partial response 
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Table 3 

Synthetic peptides for competition studies 



PROTEIN 

El 



PEPTIDE 


AMINO ACID SEQUENCE 


POSITION 


SEQ ID NO 


E1-31 


LLSCLTVPASAYQVRNSTGL 


1 81 -200 


56 


E1-33 


QVRNSTGLYHVTNDCPNSSI 


193-212 


57 


E1-35 


NDCPNSSIVYEAHDAILHT? 


205-224 


53 


E1-35A 


SNSSIVYEAADMIMHTPGCV 


208-227 


59 


E1-37 


HDAILH7PGCVPCVREGNVS 


217-236 


60 


ET-39 


CVREGNVSRCWVAMTP7VAT 


229-248 


61 


E1-41 


AMTPTVATRDGKLPATQLRR 


?4i s n 

1 "LOU 




El -43 


LPATQLRRHIDLLVGSATLC 


253-272 


63 


E1-45 


LVGSATLCSALYVGDLCGSV 


265-284 


64 


E1-49 


QLP7FSPRRHW7TQGCNCSI 


289-308 


65 


E1-51 


TQGCNCSIYPGKITGHRMAW 


301-320 


66 


E1-53 


ITGHRMAWDMMMNWSPTAAL 


313-332 


67 


E1-55 


NWSPTAALVMAQLLRIPQAI 


325-344 


63 


E1-57 


LLRIPQAILDMIAGAHVVGVL 


337-356 


69 


E1-59 


AGAHWGVLAGIAYFSMVGNM 


349-368 


70 


E1-63 


VVLLLFAGVDA5TIVSGGQA 


373-392 


7.1 
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E2 



E2-67 


SGLVSLFTP^ A Kniurm im— 


397-416 


72 


E2-69 




409-428 


73 


E2-$3B 




427-446 


74 


E2-$1B 




439-458 


75 


E2-1B 


GCPFPLA^PPP! TncnnriA/r 


451-470 


76 


E2-3B 


TDFD QGVVfiPI^Y A MrisPDnn 


463-482 


77 


E2-5B 


ANGSGPnORPYPWUVDCvar* 


475-494 


78 


E2-7B 


WHYPPKPCGIVPAK S\/Pf:p\/ 


48-7-506 


79 


E2-9B 


AKSVCGPVvpFTP^?\/\/\/r-r 


499-518 


80 


E2-1 IB 


PSPVVVPTTHP^n A DTvcm/r 


51 1-530 


81 


E2-13B 


GAPTY^ WPPMPiTPi\/C\ /! miw-t 
VJ ^ r I i J VV\JC.\U 1 U V TVLinN i 


523-542 


82 


E2-17B 


GNWrGfTWMMCTr :T/\/r^ a 
UlMWrLiU i VVlVll>Jo i ijr 1 NVwVjA 


547-566 


83 


E2-1 93 




559-575 


84 


E2-21 




571-590 


85 


E2-23 


i ut. nNrirDA t YortCGi^PW 


583-602 


86 


E2-25 


onLOiurvvl ; rrtuLVDYPYR 


595-614 


87 


E2-27 


P[ \/nVDVJ7f \A/UVQrTtM\/*t 
wLVUTr r nLWM YPC t INY i I 


607-626 


88 


E2-29 


t u\i t i irxinM Y VGG VEH 


619-638 


89 


E2-31 


MYVGGVEHRL5AACNW7PGE 


631-650 


90 


E2-33 


ACNWTPGERCDLEDRDRSEL 


643-662 


91 


E2-35 


EDRDRSELSPLLLTTTQWQV 


655-674 


92 
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CLAIMS 



1. Method for purifying recombinant HCV single or specific oiigomeric envelope 
proteins selected from the group consisting of £1 and/or E2 and/or E1/E2. 
characterized in that upon lysing the transformed host cells to isolate the 
recombinantly expressed protein a disulphide bond cieavage or reduction step is 
carried out with a disulphide bond cleavage agent. 

2. Method according to claim 1. wherein said disulphide cleavage or reduction stec is 
carried out under partial cleavage or reducing conditions. 

3. Method according to claim 1 or 2. wherein said disulphide bond cleavage agent is 
dithiothreitol (DTD. preferably in a concentration range of 0.1 to 50 mM, preferably 
0.1 to 20 mM, more preferably 0.5 to 10 mM. 

4. Method according to claim 1, wherein said disulphide bond cleavage agent is a 
detergent. 

5. Method according to claim 4, wherein said detergent is Smpigen-S3, preferably at 
a concentration of 1 to 10%, more preferably at a concentration of 3.5%. 

6. Method according to claim 1 or 2. wherein said disulphide bond cleaving agent 
comprises a combination of a classical disuiphide bond cleavage agent, such as DTT, 
and a detergent, such as Empigen-BB. 

7. Method according to any of claims 1 to 6, further comprising the step of blocking 
disulphide bond reformation with an SH group blocking agent. 

8. Method according to claim 7, wherein said SH group blocking agent is N- 
ethylmaleimide (NEM) or a derivative thereof. 

9. Method according to claim 7. wherein said step of blocking the disulphide bond 
reformation is brought about by low pH conditions. 



WO 96/04385 PCT/EP95/0303 1 

73 

10. Method according to any of claims 1 to 9, furxner characterized by at least the 
following steps: #• 

lysing recombinant El and:or E2 and.or E1/E2 expressing host cells, 
possibly in the presence of an SH blocking agent such as N- 
5 ethylmateimide (NEM), 

recovering said HCV envelope proteins by affinity purification such as 
by means of lectin-chromatography, such as tentil-iectin 
chromatography, or by means of immunoaffinity using anti-El and/or 
anti-E2 specific monoclonal antibodies. 

10 - reduction or cleavage of the disulfide bonds with a disuiphide bond 

cleaving agent, such as DT7, preferably also in the presence of an SH 
blocking agent, such as NEM or Bictin-NEM. and. 
recovering the reduced El and/or E2 and/or El 'E2 envelope proteins by 
gelfiitration and possibly aiso by a subsequent Ni-IMAC 

15 chromatography and desalting step. 

11. Composition comprising essentially purified recombinant HCV single or specific 
. oligomeric recombinant envelope proteins selected from the group consisting of E1 

and/or E2 and/or E1/E2, characterized as being isolated by a method according to 
any of claims 1 to 10. 

20 12. Composition according to claim 1 1, further characterized in that said 

recombinant HCV envelope proteins are expressed from recombinant mammaiian 
cells such as vaccinia. 

13. Composition according to claim 1 1, further characterized in that said 
recombinant HCV envelope proteins are expressed from recombinant yeast cells. 

25 14. Composition according to claim 1 1, further characterized in that said 

recombinant HCV envelope proteins are the expression product of at least one of the 
recombinant vectors according to any of claims 15 to 24. 



30 



15. Recombinant vector comprising a vector sequence, an appropriate prokaryotic. 
eukaryotic or viral promoter sequence followed by a nucleotide sequence allowing the 
expression of a single or specific oligomeric El and/or E2 and/or E1/E2 protein. 
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16. Recombinant vector according to claim 15, with said nucleotide sequence being 
characterized further in that it encodes a smgie HCV El protein starting in the region 
between amino acid positions 1 and 192 and ending in the region between amino 
acid positions 250 and 4Q0 f more particularly ending in the region between positions 

5 250 and 341, even more preferably ending in the region between position 290 and 
341. 

17. Recombinant vector according to claim 15, with said nucleotide sequence being 
characterized further in that it encodes a single HCV E1 protein starting in the region 
between amino acid positions 117 and 132 and ending in the region between amino 

10 acid positions 263 and 400, more particularly ending in the region between positions 
250 and 325. 

13. Recombinant vector according to any of claims 16 or 17, with said nucleotide 
sequence being characterized further in that it encodes a single HCV El protein 
bearing a deletion of the first hydrophobic domain between positions 264 to 293, 
15 plus of minus 8 amino acids. 

19. Recombinant vector according to claim 15, with said nucleotide sequence being 
characterized further in that it encodes a single HCV E2 protein starting in the region 
between amino acid positions 290 and 405 and ending in the region between amino 
acid positions 600 and 320, more particularly starting in the region between 

20 positions 322 and 406, even more preferably starting in the region between- position 
347 and 406 and most preferably starting in the region between positions 364 and 
406. 

20. Recombinant vector according to claim 19, with said nucleotide sequence being 
characterized further in that it ends at any of amino acid positions 623, 650, 661, 

25 673, 71 0, 71 5, 720, 745 or 809. 

21. Recombinant vector according to any of claims 16 to 20, with said nucleotide 
sequence being characterized further in that a 5'-terminal ATG codon and a 3'- 
termina! stop codon have been added to it. 

30 22. Recombinant vector according to any of claims 16 to 21, with said nucleotide 
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sequence being characterized further in that a factor Xa cleavage site and:or 3 to 10. 
preferably 6, histidine codons havs been added 3'-terrninaiiy to the coding region. 

23. Nucleic acid comprising any of the sequences as represented in SEQ ID NO 3, 5, 
7. 9, 11, 13, 21, 23, 25, 27, 29, 31, 35, 37, 39. 41, 43, 45, 47 and 49, or parts 

5 thereof. 

24. Recombinant vector carrying a recombinant nucleic acid according to ciaim 23. 

25. Recombinant vector according to any of claims 15 to 24, further characterized in 
that at least one of the giycosyistion sites present in said El or E2 protein has been 
removed at the nucleic acid level. 



10 26. A host cell transformed with at least one recombinant vector according to any of 
claims 15 to 26, wherein said vector comprises a nucleotide sequence encoding HCV 
E1 and/or E2 and/or E1/E2 protein as defined in any of claims 1 5 to 23 in addition to 
a regulatory sequence operable in said host celi and capable of regulating expression 
of said HCV E1 and/or E2 and/or E1/E2 protein. 

15 27. A recombinant E1 and/or E2 and/or E1/E2 protein expressed by a host cell 
according to claim 26. 

28. Method according to any of claims 1 to 10, further characterized as comprising 
at least the following steps: 

growing a host ceil as defined in claim 26 transformed with a recombinant 
20 vector according to any of claims 15 to 25 in a suitable culture medium, 

causing expression of said vector sequence as defined in any of claims 16 to 
25 under suitable conditions, and, 
* - lysing said transformed host cells, preferably in the presence of an SH group 
blocking agent, such as N-ethyimaieimide (NEM), 
25 - recovering said HCV envelope protein by affinity purification by means of for 
instance lectin-chromatography or immunoaffinity chromatography using anti- 
El and/or anti-E2 specific monoclonal antibodies, with said lectin being 
preferably lentil-lectin, followed by, 

incubation of the eluate of the previous step with a disulphide bond cleavage 
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agent, such as 07 1, preferably aiso in the presence of an SK group blocking 
agent, such as NSM or Biotin-NEM. and. 

isolating the HCV single or specific oligcmenc El and/or E2 and/or E1/E2 
. proteins by means of gelfiitration and possibly also by means of an additional 
Nr~-IMAC chromatography and desalting step. 

29. A composition comprising at least one of the following El and/or E2 peptides: 
E1-31 (SEQ ID NO 56) spanning amino acids 181 to 200 of the Core 'El V1 
region. 

E1-33 (SEQ ID NO 57) spanning ammo acids 1S3 to 212 of the E1 region. 
E1-35 (SEQ ID NO 53) spanning amino acids 205 to 224. of the El V2 region 
(epitope B), 

El -35 A (SEQ ID NO 59) scanning ammo acids 20S to 227 of the E1 V2 
region (epitope B), 

1bEl (SEQ ID NO 53) spanning amino acids 132 to 228 of E1 regions (V1, 
CI, and V2 regions (containing epitope B). 

E1-51 (SEQ ID NO 86) spanning ammo acids 301 to 320 of the E1 region, 
E1-53 (SEQ ID NO 67) spanning amino acids 313 to 332 of the E1 C4 region 
(epitope A), 

. El-55 (SEQ ID NO 68) spanning amino acids 325 to 344 of the E1 region. 
Env 67 or E2-57 (SEQ ID NO 721 spanning amino acid positions 397 to 416 of 
the E2 region (epitope A), 

Env 69 or E2-69 (SEQ ID NO 73) spanning amino acid positions 409 to 428 of 
the E2 region (epitope A). 

Env 23 or E2-23 (SEQ ID NO 86) spanning positions 583 to 602 of the E2 
region (epitope EL 

Env 25 or E2-25 (SEQ ID NO 87) spanning positions 595 to 614 of the E2 
region (epitope E), 

Env 27 or E2-27 (SEQ ID NO 88) spanning positions 607 to 626 of the E2 
region (epitope E), 

Env 17B or E2-173 (SEQ ID NO 83) spanning positions 547 to 566 of the E2 
region (epitope D), 

Env 1 3B or E2-1 3B (SEQ ID NO 82) spanning positions 523 to 542 of the E2 
region (epitope C). 
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30. A composition comprising at least one of the following E2 conformational 
epitopes: *• 

epitope F recognized by monoclonal antibodies 15C3C1, 12D11F1, and 
8G70D1H9, 

epitope G recognized by monoclonal antibody 9G3E5, 

epitope H {or C) recognized by monocionai antibodies 10D3C4 and 4H682, 

epitope I recognized by monoclonal antibody 17F2C2. 

31. An E1 and/or E2 specific monocionai antibody raised upon immunization with a 
composition according to any of claims 1 1 to 14 or 29 to 30. 

32. An E1 and/or E2 specific monoclonal antibody according to claim 31 for use as a 
medicament, more particularly for incorporation into an immunoassay kit for 
detecting the presence of HCV El or E2 antigen, for prognosis/monitoring of disease 
or for HCV therapy. 

33. Use of an E1 and/or E2 specific monoclonal antibody according to claim 31 for 
the preparation of an immunoassay kit for detecting HCV El or E2 antigens, for the 
preparation of a kit for prognosing/monitoring of HCV disease or for the preparation 
of a HCV medicament. 

34. Method for in vitro diagnosis of HCV antigen present in a biological sample, 
comprising at least the following steps: 

{l} contacting said biological sample with an E1 and/or E2 

specific monocionai antibody according to claim 31, 
preferably in an immobilized form under appropriate 
conditions which allow the formation of an immune 
complex, 

removing unbound components, 
incubating the immune complexes formed with 
heterologous antibodies, with said heterologous antibodies 
being conjugated to a detectable label under appropriate 
conditions. 

detecting the presence of said immune complexes visually 
or mechanically. 



(ii) 
(iii) 



(iv) 
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35. Kit for determining the presence of HCV antigens present m a biological sample, 
comprising: *■ 

at least one E1 and/or E2 specific monoclonal antibody according to 
claim 31, preferably in an immobilized form on a solid substrate, 
a buffer or components necessary for producing the buffer enabling 
binding reaction between these antibdcdies and the HCV antigens 
present in said biological sample, 

a means for detecting the immune complexes formed in the preceding 
binding reaction. 

• 36. A composition according to any of claims 11 ro 14 or 29 to 30 for use as a 
medicament. 

37. A composition according to any of claims 11 to 14 or 29 to 30 for use as a 
vaccine for immunizing a mammal, preferably humans, against HCV, comprising 
administrating an effective amount of said composition possibly accompanied by 
pharmaceutical^ acceptable adjuvants, to produce an immune response. 

38. Use of a composition according to any of claims 1 1 to 14 or 29 to 30, for the 
preparation of a vaccine for immunizing a mammal, preferably humans, against HCV, 
comprising administrating an effective amount of said composition possibly 
accompanied by pharmaceutical^ acceptable adjuvants, to produce an immune 
response. 

39. Vaccine composition for immunzing a mammal, preferably humans, against HCV. 
comprising an effective amount of a composition according to any of claims 1 1 to 14 
or 29 to 30 possibly accompanied by pharmaceutical^ acceptable adjuvants. 

40. A composition according to any of claims 1 1 to 14 or 29 to 30, for in vitro 
detection of HCV antibodies present in a biological sample. 

41. Use of a composition according to claims 11 to 14 or 29 to 30, for the 
preparation of an immunoassay kit for detecting HCV antibodies present in a 
biological sample. 
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42. Method for in vitro diagnosis of HCV antibodies present in, a biological sample, 
comprising at least the foilowirfg steps: 

(i) contacting said biological sample with a composition 
according to any of claims 11 to 14 or 29 to 30, 

5 preferably in an immobilized form under appropriate 

conditions which allow the formation of an immune 
complex, 

(ii) removing unbound components, 

(in) incubating the immune complexes formed with 

10 heterologous antibodies, with said heterologous antibodies 

being conjugated to a detectable label under appropriate 
conditions, 

(iv) detecting the presence of said immune complexes visually 

or mechanically. 

15 43. Kit for determining the presence of HCV antibodies present in a biological 
sample, comprising: 

at least one peptide or protein composition according to any of claims 
11 to 14 or 29 to 30, preferably in an immobilized form on a solid 
substrate, 

20 a buffer or components necessary for producing the buffer enabling 

binding reaction between these proteins or peptides and the antibodies 
against HCV present in said biological sample, 

a means for detecting the immune complexes formed in the preceding 
binding reaction. 

25 44. Use of composition comprising El proteins according to any of claims 1 1 to 14. 
or parts thereof according to claim 29, more particularly HCV single El proteins or 
E1 peptides, for in vitro monitoring HCV disease or prognosing the response to 
treatment, particularly with interferon, of patients suffering from HCV infection 
comprising: 

30 - incubating a biological sample from a patient with HCV infection with 

an El protein or a suitable part thereof under conditions allowing the 
formation of an immunological complex, 
removing unbound components, 
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calculating the anti-El titers present in said sample at the start of and 
during the coursi" of treatment, 

monitoring the natural course of HCV disease, or prcgnosing the 
response to treatment of said patient on the basis of the amount anti- 
El titers found in said sample at the start of treatment and/or during 
the course of treatment. 



45. Kit for monitoring HCV disease or prognosmg the response to treatment, 
particuiariy with interferon, of patients suffering from KCV infection comprising: 

at least one E1 protein or El peptide, more particularly an El protein or 
El peptide according to any of claims 1 1 to 14 or 29, 
a buffer or components necessary for producing the buffer enabling the 
binding reaction between these proteins or peptides and the anti-El 
antibodies present in a biological sample, 

means for detecting the immune complexes formed in the preceding 
binding reaction, 

possibly also an automated scanning and interpretation device for 
inferring a decrease of anti-E1 titers during the progression of 
treatment. 



46. A serotyping assay for detecting one or more serological types of HCV present in 
a biological sample, more particuiariy for detecting antibodies of the different types 
of HCV to be detected combined in one assay format comprising at least the 
following steps : 

(i) contacting the biological sample to be analyzed for the presence of 
HCV antibodies of one or more serological* types, with at least one of 
the El and/or E2 and/or E1/E2 protein compositions according to any 
of claims 1 1 to 14 or at least one of the El or E2 peptide compositions 
according to claim 29, preferentially in an immobilized form under 
appropriate conditions which allow the formation of an immune 
complex, 

(ii) removing unbound components, 

(iii) incubating the immune complexes formed with heterologous 
antibodies, with said heterologous antibodies being conjugated to a 
detectable label under appropriate conditions. 
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(iv) detecting the presence of said immune complexes visually or 

mechanically (e.d. by means of densitometry, fluorimerry. colcrimetry; 
and inferring the presence of one or more HCV serological Types 
present from the observed binding pattern. 

47. Kit for serotyping one or more serological types of HCV present in a biological 
sample, more particularly for detecting the antibodies" to these serological types of 
HCV comprising: 

at least one El and/or 52 and/or E1/E2 protein acccording to any of 
claims 1 1 to 1* or El or E2 peptide according to claim 29, 
a buffer or components necessary for producing the buffer enabling the 
binding reaction between these proteins or peptides and the anti-El 
antibodies present in a biological sample, 

means for detecting the immune complexes formed in the preceding 
binding reaction, 

15 - possibly also an automated scanning and interpretation device for 

detecting the presence of one or more serological types present from 
the observed binding pattern. 

43. A peptide or protein composition according to any of claims 1 1 to 14 or 29, 
for immobilization on a solid substrate and incorporation into a reversed phase 
20 hybridization assay, preferably for immobilization as parallel lines onto a solid support 
such as a membrane strip, for determining the presence or the genotype of HCV 
according to a method of any of claims 42 or 46. 
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Figure 21 



5' GGCA7GCAAGC77AA77AA773' (SEQ ID NO 1) 
3'ACGTCCGTACGTTCGAATTAATTAATCGA5' (SEQ ID NO 94) 



S'CCGGGGAGGCCTGCACGTGATCGAGGGCAGACACCATCACCACCATCACTAATAGT 
TAATTAACTGCA 3' (SEQ ID NO 2) 

3'CCTCCGGACGTGCACTAGCTCCCGTCTGTGGTAGTGGTGGTAGTGATTATCAATTAATTG 
5' (SEQ !D NO 25) 



SEQ !D NO 3 (r,CC!9A) 

ATGCCCGG77GC7CTT7C7CTATC7TCCTCTTGGCTTTACTG7CC7GTCTGACCATTCCA 
GC77CCGC77A7GAGG7GCGCAACG7G7CCGGGA7G7ACCA7G7CACGAACGAC7GC7 
CCAAC7CAAGCA77G7G7A7GAGGCAGCGGACA7GA7CA7GCACACCCCCGGG7GCG7 
GCCC7GCG77CGGGAGAACAAC7C77CCCGC7GC7GGG7AGCGC7CACCCCCACGC7C 
GCAGC7AGGAACGCCAGCG7CCCCACCACGACAA7ACGACGCCACG7CGA777GC7CG 
7TGGGGCGGC7GC7C7C7G7TCCGC7A7G7ACG7GGGGGA7C7C7GCGGA7C7G7C77C 
C7CG7C7CCCAGC7G77CACCA7C7CGCC7CGCCGGCA7GAGACGG7GCAGGAC7GCA 
A77GC7CAA7C7A7CCCGGCCACA7AACAGG7CACCG7A7GGC77GGGA7A7GA7GA7 
GAAC7GG7CGCC7ACAACGGCCC7GG7GG7A7CGCAGC7GC7CCGGA7CCCACAAGC7 
G7CG7GGACATGG7GGCGGGGGCCCA7TGGGGAGTCC7GGCGGGCCTCGCC7AC7AT7 
CCA7GG7GGGGAAC7GGGC7AAGG7777GA77G7GA7GC7AC7C777GC7C7C7AA7AG 

SEQ ID NO 5 (HCC110A) 

A7G77GGG7AAGG7CA7CGA7ACCC77ACA7GCGGC77CGCCGACC7CG7GGGG7ACA 
77CCGC7CG7CGGCGCCCCCC7AGGGGGCGC7GCCAGGGCCC7GGCGCA7GGCG7CCG 
GG77C7GGAGGACGGCG7GAAC7A7GCAACAGGGAA7T7GCCCGG77GC7C777C7C7 
A7C77CC7C77GGC777GC7G7CC7G7C7GACCG77CCAGC7TCCGC77A7GAAG7GCG 
CAACG7G7CCGGGA7G7ACCA7G7CACGAACGAC7GC7CCAAC7CAAGCA77G7G7A7 
GAGGCAGCGGACA7GA7CA7GCACACCCCCGGG7GCG7GCCC7GCG77CGGGAGAAC 
AAC7C77CCCGC7GC7GGG7AGCGC7CACCCCCACGC7CGCAGC7AGGAACGCCAGCG 
7CCCCACCACGACAA7ACGACGCCACG7CGA777GC7CG77GGGGCGGC7GC777C7G 
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TTCCGCTATGTACGTGGGGGACCTCTGCGGATCTGTCT7CCTCGTCTCCCAGC7GTTCA 
CCA7CTCGCCTCGCCGGCATGAGACGGTGCAGGAC7GCAA77GCTCAATCTA7CCCGG 
CCACA7AACGGG7CACCG7A7GGC77GGGA7A7GA7GA7GAAC7GG7CGCC7ACAACG 
GCCC7GG7GG7A7CGCAGC7GC7CCGGA7CCCACAAGC7G7CG7GGACA7GG7GGCGG 
GGGCCCA77GGGGAG7CC7GGCGGG7C7CGCC7AC7A77CCA7GG7GGGGAAC7GGGC 
7AAGG7777GA77G7GA7GC7AC7C777GC7CCC7AA7AG 

SEQ ID NO 7 (HCCI1 1AJ 

A7G77GGG7AAGG7CA7CGA7ACCC77ACG7GCGGC77CGCCGACC7CA7GGGG7ACA 
77CCGC7CG7CGGCGCCCCCC7AGGGGG7GC7GCCAGAGCCC7GGCGCA7GGCG7CCG 
GG77C7GGAAGACGGCG7GAAC7A7GCAACAGGGAA777GCC7GG77GC7C777C7C7A 
7C77CC7C77GGC777AC7G7CC7G7C7GACCA77CCAGC77CCGC77A7GAGG7GCGC 
AACG7G7CCGGGA7G7ACCA7G7CACGAACGAC7GC7CCAAC7CAAGCA77G7G7A7G 
AGGCAGCGGACA7GA7CA7GCACACCCCCGGG7GCG7GCCC7GCG77CGGGAGAACA 
AC7C77CCCGC7GC7GGG7AGCGC7CACCCCCACGC7CGCAGC7AGGAACGCCAGCG7 
CCCCAC7ACGACAA7ACGACGCCACG7CGA777GC7CG77GGGGCGGC7GC777C7G77 
CCGC7A7G7ACG7GGGGGA7C7C7GCGGA7C7G7C77CC7CG7C7CCCAGC7G77CACC 
A7C7CGCC7CGCCGGCA7GAGACGG7GCAGGAC7GCAA77GC7CAA7C7A7CCCGGCC 
ACA7AACAGG7CACCG7A7GGC77GGGA7A7GA7GA7GAAC7GG7AA7AG 

SEQ ID NO 9 (HCCI12A) 

A7GCCCGG77GC7C777C7C7A7C77CC7CT7GGCCC7GC7G7CC7G7C7GACCA7ACCA 
GC77CCGCT7A7GAAG7GCGCAACG7G7CCGGGG7G7ACCA7G7CACGAACGAC7GC7 
CCAAC7CAAGCA7AG7G7A7GAGGCAGCGGACA7GA7CA7GCACACCCCCGGG7GCG7 
GCCC7GCG77CGGGAGGGCAAC7CC7CCCG77GC7GGG7GGCGC7CAC7CCCACGC7C 
GCGGCCAGGAACGCCAGCG7CCCCACAACGACAA7ACGACGCCACG7CGA777GC7C 
G77GGGGC7GC7GC777C7G77CCGC7A7G7ACG7GGGGGA7C7C7GCGGA7C7G7777 
CC77G777CCCAGC7G77CACC77C7CACC7CGCCGGCA7CAAACAG7ACAGGAC7GCA 
AC7GC7CAA7C7A7CCCGGCCA7G7A7CAGG7CACCGCATGGC7TGGGA7A7GA7GA7 
GAAC7GG7CC7AA7AG 



SEQ ID NO 11 (HCCI13A) 

A7G7CCGG77GC7C777C7C7A7C77CC7C77GGCCC7GC7G7CC7G7C7GACCA7ACCA 
GC77CCGC77A7GAAG7GCGCAACG7G7CCGGGG7G7ACCA7G7CACGAACGAC7GC7 
CCAAC7CAAGCATAG7G7A7GAGGCAGCGGACATGATCATGCACACCCCCGGG7GCG7 
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GCCC7GCGT7CGGGAGGGCAACTCCTCCCG7TGC7GGG7GGCGCTCACTCCCACGCTC 
GCGGCCAGGAACGCCAGCG7CCCCACAACGACAA7ACGACGCCACG7CGA777GC7C 
G77GGGGC7GC7GC777C7G77CCGC7A7G7ACG7GGGGGA7C7C7GCGGA7C7G7777 
CC77G777CCCAGC7G77CACC77C7CACC7CGCCGGCA7CAAACAG7ACAGGAC7GCA 

AC7GC7CAA7C7A7CCCGGCCA7G7A7CAGG7CACCGCA7GGC77GGGA7A7GA7GA7 
GAAC7GG7AA7AG 

SEQ ID NO 13 (HCCI17A) 

A7GC7GGG7AAGGCCA7CGA7ACCC77ACG7GCGGC77CGCCGACC7CG7GGGG7ACA 

T7CCGC7CG7CGGCGCCCCCC7AGGGGGCGC7GCCAGGGCCC7GGCGCA7GGCG7CCG 

GG7TCTGGAAGACGGCGTGAAC7A7GCAACAGGGAA777GCC7GG77GC7C777C7C7A 

TC77CC7C77GGC777AC7G7CC7G7C7AACCA77CCAGC77CCGC77ACGAGG7GCGC 

AACG7G7CCGGGA7G7ACCA7G7CACGAACGAC7GC7CCAAC7CAAGCA77G7G7A7G 

AGGCAGCGGACA7GA7CA7GCACACCCCCGGG7GCG7GCCC7GCG77CGGGAGAACA 

AC7C77CCCGC7GC7GGG7AGCGC7CACCCCCACGC7CGCGGC7AGGAACGCCAGCA7 

CCCCAC7ACAACAA7ACGACGCCACG7CGA777GC7CG77GGGGCGGC7GC777C7G77 

CCGC7A7G7ACG7GGGGGA7C7C7GC3GA7C7G7C77CC7CG7C7CCCAGC7G77CACC 

A7C7CGCC7CGCCGGCA7GAGACGG7GCAGGAC7GCAA77GC7CAA7C7A7CCCGGCC 

ACA7AACGGG7CACCG7A7GGC77GGGA7A7GA7GA7GAAC7GG7AC7AA7AG 



SEQ ID NO 15 (HCPr5 i) 
A7GCCCGG77GC7C777C7C7A7C77 

SEQ ID NO 16 (HCPr52) 
A7G77GGG7AAGG7CA7CGA7ACCC7 

SEQ ID NO 17 (HCPr53) 
C7A77AGGACCAG77CA7CA7CA7A7CCCA 

SEQ ID NO 18 (HCPr54) 
C7A77ACCAG77CA7CA7CA7A7CCCA 



SEQ ID NO 19 (HCPrl07) 

A7ACGACGCCACG7CGA7TCCCAGC7G77CACCA7C 
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SEQ ID NO 20 (HC?rl03) 

GATGGTGAACAGCTGGGAATCGACGTGGCGTCGTAT 
SEQ ID NO 21 (HCCI37) 

A7GTTGGGTAAGGTCATCGATACCCTTACATGCGGC7TCGCCGACCTCGTGGGGTACA 
T7CCGCTCGTCGGCGCCCCCCTAGGGGGCGCTGCCAGGGCCCTGGCGCATGGCGTCCG 
GGTTCTGGAGGACGGCGTGAACTATGCAACAGGGAATTTGCCCGGTTGCTCTTTCTCT 
ATCT7CCTCT7GGCTTTGCTGTCCTGTCTGACCG77CCAGCTTCCGCTTATGAAGTGCG 
CAACGTGTCCGGGATGTACCATGTCACGAACGACTGCTCCAACTCAAGCATTGTGTA7 
GAGGCAGCGGACATGATCATGCACACCCCCGGGTGCGTGCCCTGCGT7CGGGAGAAC 
AACTCTTCCCGCTGCTGGGTAGCGCTCACCCCCACGCTCGCAGCTAGGAACGCCAGCG 
TCCCCACCACGACAA7ACGACGCCACG7CGAT7CCCAGCTGTTCACCATCTCGCC7CG 
CCGGCA7GAGACGG7GCAGGACTGCAA77GC7CAA7C7A7CCCGGCCACA7AACGGG7 
CACCG7A7GGC77GGGA7A7GATGA7GAACTGG7CGCC7ACAACGGCCC7GG7GG7A7 
CGCAGC7GC7CCGGA7CCCACAAGC7G7CG7GGACA7GG7GGCGGGGGCCCA77GGGG 
AG7CC7GGCGGG7C7CGCC7AC7A77CCA7GG7GGGGAAC7GGGC7AAGG7777GA77G 
7GA7GC7AC7C777GC7CCC7AA7AG 

SEQ ID NO 23 (HCCI3S) 

A7G77GGG7AAGG7CA7CGA7ACCC77ACA7GCGGC77CGCCGACC7CG7GGGG7ACA 

77ccgc7cg7cggcgcccccc7agggggcgc7gccagggccc7ggcgca7ggcg7ccg 
gg7tc7ggaggacggcg7gaactatgcaacagggaa77tgcccggt7gctctttc7ct 
a7c77cc7c77ggc777gc7g7cc7g7c7gaccg77ccagc77ccgc77a7gaag7gcg 
caacgtgtccgggatgtaccatgtcacgaacgactgctccaactca'agcattgtgtat 
gaggcagcggaca7ga7ca7gcacacccccggg7gcg7gccc7gcg77cgggagaac 

AAC7C77CCCGC7GC7GGG7AGCGC7CACCCCCACGC7CGCAGC7AGGAACGCCAGCG 

7CCCCACCACGACAA7ACGACGCCACG7CGA77CCCAGC7G77CACCA7C7CGCC7CG 

CCGGCA7GAGACGG7GCAGGAC7GCAA77GC7CAA7C7A7CCCGGCCACA7AACGGG7 

CACCG7A7GGC77GGGA7A7GA7GA7GAAC7GG7AA 

7AG 

SEQ ID NO 25 (HCCI39) 

ATGTTGGG7AAGGTCATCGATACCC7TACATGCGGCT7CGCCGACCTCGTGGGGTACA 
77CCGC7CG7CGGCGCCCCCC7AGGGGGCGC7GCCAGGGCCC7GGCGCA7GGCG7CCG 
GG77C7GGAGGACGGCG7GAAC7A7GCAACAGGGAA777GCCCGG77GC7C777C7C7 



WO 96/04385 



25/53 



PCT/EP95/0303I 



A: ~CCTCTTGGCTT7GCTG7CCTGTCTGACCG7TCCAGCT7CCGCTTATGAAG7GCG 
CAACG7G7CCGGGA7G7ACCA7G7CACGAACGAC7GC7CCAAC7CAAGCA77G7G7A7 
GAGGCAGCGGACA7GA7CA7GCACACCCCCGGG7GCG7GCCC7GCG77CGGGAGAAC 
AAC7C77CCCGC7GC7GGG7AGCGC7CACCCCCACGC7CGCAGC7AGGAACGCCAGCG 
7CCCCACCACGACAA7ACGACGCCACG7CGA77CCCAGC7G77CACCA7C7CGCC7CG 
CCGGCA7GAGACGG7GCAGGAC7GCAA77GC7CAA7C7A7CCCGGCCACA7AACGGG7 
CACCG7A7GGC77GGGA7A7GA7GA7GAAC7GG7CGCC7ACAACGGCCC7GG7GG7A7 
CGCAGC7GC7CCGGA7CC7C7AA7AG 

S5Q ID NO 27 (HCCI40) 

" A7G7TGGG7AAGG7CATCGA7ACCC77ACA7GCGGC77CGCCGACC7CG7GGGG7ACA 
"n"CCGC7CG7CGGCGCCCCCC7AGGGGGCGC7GCCAGGGCCC7GGCGCA7GGCG7CCG 
GG77C7GGAGGACGGCG7GAAC7A7GCAACAGGGAA777GCCCGG77GC7C777C7C7 
A7C77CC7C77GGC777GC7G7CC7G7C7GACCG77CCAGC77CCGC77A7GAAG7GCG 
CAACG7G7CCGGGA7G7ACCA7G7CACGAACGAC7GC7CCAAC7CAAGCA77G7G7A7 
GAGGCAGCGGACA7GA7CA7GCACACCCCCGGG7GCG7GCCC7GCG77CGGGAGAAC 
AAC7C77CCCGC7GC7GGG7AGCGC7CACCCCCACGC7CGCAGC7AGGAACGCCAGCG 
7CCCCACCACGACAA7ACGACGCCACG7CGA77CCCAGC7G77CACCA7C7CGCC7CG 
CCGGCA7GAGACGG7GCAGGAC7GCAA77GC7CAA7C7A7CCCGGCCACA7AACGGG7 
CACCG7A7GGC77GGGA7A7GA7GA7GAAC7GG7CGCC7ACAACGGCCC7GG7GG7A7 
CGCAGC7GC7CCGGA7CG7GA7CGAGC-GCAGACACCA7CACCACCA7CAC7AA7AG 

S£Q ID NO 29 (HCCI62) 

A7GGG7AAGG7CA7CGA7ACCC77ACG7GCGGA77CGCCGA7C7CA7GGGG7ACA7CC 
CGC7CG7CGGCGC7CCCG7AGGAGGCG7CGCAAGAGCCC77GCGCA7GGCG7GAGGGC 
CC77GAAGACGGGA7AAA777CGCAACAGGGAA777GCCCGG77GC7CC7777C7A777 
7CC77C7CGC7C7G77C7C77GC77AA77CA7CCAGCAGC7AG7C7AGAG7GGCGGAA7 
ACG7C7GGCC7C7A7G7CC77ACCAACGAC7G77CCAA7AGCAG7A77G7G7ACGAGGC 
CGA7GACG77A77C7GCACACACCCGGC7GCA7ACC77G7G7CCAGGACGGCAA7ACA 
TCCACG7GC7GGACCCCAG7GACACC7ACAG7GGCAG7CAAG7ACG7CGGAGCAACCA 

CCGCT7CGA7ACGCAG7CA7G7GGACC7A77AG7GGGCGCGGCCACGA7G7GC7C7GC 
GC7C7ACG7GGG7GACA7G7G7GGGGC7G7C77CC7CG7GGGACAAGCC77CACG77CA 
GACC7CG7CGCCA7CAAACGG7CCAGACC7G7AAC7GC7CGC7G7ACCCAGGCCA7C7 
TTCAGGACA7CGAA7GGCT7GGGA7A7GA7GA7GAAC7GG7AA7AG 
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SEQ ID NO 3. (HCCI63) 

ATGGG7AAGGTCATCGATACCCTAACGTGCGGAT7CGCCGATCTCATGGGGTATATCC 

i 

CGCTCGTAGGCGGCCCCATTGGGGGCGTCGCAAGGGCTCTCGCACACGGTGTGAGGGT 

CCTTGAGGACGGGGTAAACTATGCAACAGGGAATTTACCCGGTTGCTCTTTCTCTATCT 

TTA77CTTGCTCTTCTCTCGTGTCTGACCGTTCCGGCCTCTGCAGTTCCCTACCGAAATG 

CCTCTGGGATTTA7CATGTTACCAATGATTGCCCAAACTCTTCCATAGTCTATGAGGCA 

GA7AACC7GA7CC7ACACGCACC7GGT7GCG7GCC77G7G7CA7GACAGG7AA7GTGA 

G7AGA7GC7GGG7CCAAA77ACCCC7ACAC7G7CAGCCCCGAGCC7CGGAGCAG7CAC 

GGC7CC7C77CGGAGAGCCG77GAC7ACC7AGCGGGAGGGGC7GCCC7C7GC7CCGCG 

77A7ACG7AGGAGACGCG7G7GGGGCAC7A77C77GG7AGGCCAAA7G77CACC7A7A 

GGCC7CGCCAGCACGCTACGG7GCAGAAC7GCAAC7G77CCA777ACAG7GGCCA7G7 

7ACCGGCCACCGGA7GGCA7GGGA7A7GA7GA7GAAC7GG7AA7AG 

SEQ ID NO 33 (HCPr109) 
7GGGA7A7GA7GA7GAAC7GG7C 

SEQ ID NO 34 (HCPr72) 

C7A77A7GG7GG7AAKGCCARCARGAGCAGGAG 
SEQ ID NO 35 (HCCL22A) 

7GGGA7A7GA7GA7GAAC7GG7CGCC7ACAACGGCCC7GG7GG7A7CGCAGC7GC7CC 
GGA7CCCACAAGC7G7CG7GGACA7GG7GGCGGGGGCCCA77GGGGAG7CC7GGCGG 
GCC7CGCC7AC7A77CCA7GG7GGGGAAC7GGGC7AAGG7777GG77G7GA7GC7AC7C 
T7TGCCGGCG7CGACGGGCATACCCGCGTGTCAGGAGGGGCAGCAGCCTCCGATACCA 
GGGGCC77G7G7CCC7C777AGCCCCGGG7CGGC7CAGAAAA7CCAGC7CG7AAACAC 
CAACGGCAG77GGCACA7CAACAGGAC7GCCC7GAAC7GCAACGAC7CCC7CCAAAC 
AGGG77C777GCCGCAC7A77C7ACAAACACAAA77CAAC7CG7C7GGA7GCCCAGAG 
CGC7TGGCCAGC7G7CGC7CCA7CGACAAG77CGC7CAGGGG7GGGG7CCCC7CAC77 
ACAC7GAGCC7AACAGC7CGGACCAGAGGCCC7AC7GC7GGCAC7ACGCGCC7CGACC 
G7G7GG7A77G7ACCCGCG7C7CAGG7G7GCGG7CCAG7G7A77GC77CACCCCGAGCC 
C7G77G7GG7GGGGACGACCGA7CGG777GG7G7CCCCACG7A7AAC7GGGGGGCGAA 
CGAC7CGGATG7GCTGATTCTCAACAACACGCGGCCGCCGCGAGGCAAC7GG7TCGGC 
TG7ACA7GGA7GAA7GGCACTGGG7TCACCAAGACG7G7GGGGGCCCCCCG7GCAACA 
7CGGGGGGGCCGGCAACAACACC77GACC7GCCCCAC7GAC7G7777CGGAAGCACCC 
CGAGGCCACC7ACGCCAGA7GCGG77C7GGGCCC7GGC7GACACC7AGG7G7A7GG77 
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CA ; i ACCCATA i AGGC i CTGGCAC i ACCCC i GCACTGTCAACTTCACCA7CTTCAAGG7 

TAGGA7G7ACG7GGGGGGCG7GGAGCACAGG7TCGAAGCCGCA7GCAA77GGAC7CG 

AGGAGAGCG77G7GAC77GGAGGACAGGGA7AGA7CAGAGC77AGCCCGC7GC7GCTG 

TC7ACAACAGAG7GGCAGA7ACTGCCC7GTTCC7TCACCACCCTGCCGGCCC7A7CCA 

CCGGCC7GA7CCACC7CCA7CAGAACA7CG7GGACG7GCAA7ACC7G7ACGG7G7AGG 

G7CGGCGG77G7C7CCC77G7CA7CAAA7GGGAG7A7G7CC7G77GC7C77CC77C7CC7 

GGCAGACGCGCGCA7C7GCGGC7GC77A7GGA7GA7GC7GC7GA7AGC7CAAGC7GAG 

GCCGCCT7AGAGAACC7GG7GG7CC7CAA7GCGGCGGCCG7GGCCGGGGCGCA7GGC 

AC7C77TCC77CC77G7G7TC77C7G7GC7GCC7GG7ACA7CAAGGGCAGGC7GG7CCC 

TGG7GCGGCA7ACGCC77C7A7GGCG7G7GGCCGC7GC7CC7GC77C7GC7GGCC77AC 
CACCACGAGC77A7GCC7AG7AA 



5=G ID NO 37 (KCCI^l) 

GA7CCCACAAGC7G7CG7GGACA7GG7GGCGGGGGCCCA77GGGGAG7CC7GGCGGG 

CC7CGCC7AC7A77CCA7GG7GGGGAAC7GGGC7AAGG7777GG77G7GA7GC7AC7C7 

T7GCCGGCG7CGACGGGCA7ACCCGCG7G7CAGGAGGGGCAGCAGCC7CCGA7ACCA 

GGGGCC77G7G7CCC7C777AGCCCCGGG7CGGC7CAGAAAA7CCAGC7CG7AAACAC 

CAACGGCAG77GGCACA7CAACAGGAC7GCCC7GAAC7GCAACGAC7CCC7CCAAAC 

AGGG77C777GCCGCAC7A77C7ACAAACACAAA77CAAC7CG7C7GGA7GCCCAGAG 

CGC77GGCCAGC7G7CGC7CCA7CGACAAG77CGC7CAGGGG7GGGG7CCCC7CAC7T 

ACAC7GAGCC7AACAGC7CGGACCAGAGGCCC7AC7GC7GGCAC7ACGCGCC7CGACC 

GTG7GG7A77G7ACCCGCG7C7CAGG7G7GCGG7CCAG7G7A77GC77CACCCCGAGCC 

C7G77G7GG7GGGGACGACCGA7CGG777GG7G7CCCCACG7A7AAC7GGGGGGGGAA 

CGACTCGGA7G7GC7GA7TC7CAACAACACGCGGCCGCCGCGAGGCAAC7GG7TCGGC 

7G7ACA7GGA7GAA7GGCAC7GGG77CACCAAGACG7G7GGGGGCCCCCCG7GCAACA 

7CGGGGGGGCCGGCAACAACACC77GACC7GCCCCAC7GAC7G7777CGGAAGCACCC 

CGAGGCCACCTACGCCAGA7GCGG77C7GGGCCC7GGC7GACACC7AGG7G7A7GG7T 

CA77ACCCA7A7AGGC7C7GGCAC7ACCCC7GCAC7G7CAAC77CACCA7C77CAAGG7 

7AGGA7G7ACG7GGGGGGCG7GGAGCACAGG77CGAAGCCGCA7GCAA77GGAC7CG 

AGGAGAGCG77G7GAC77GGAGGACAGGGA7AGA7CAGAGC77AGCCCGC7GC7GC7G 

TC7ACAACAGAG7GGCAGAG7GGCAGAGC77AA77AA77AG 

SEQ ID NO 39 (HCCI42) 

GA7CCCACAAGC7G7CG7GGACA7GG7GGCGGGGGCCCA77GGGGAG7CC7GGCGGG 
CC7CGCC7AC7A77CCA7GG7GGGGAAC7GGGC7AAGG7777GG77G7GA7GC7AC7C7 
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T7GCCGGCG7CGACGGGCA7ACCCGCG7G7CAGGAGGGGCAGCAGCC7CCGA7ACCA 

GGGGCC77G7G7CCC7C777AGCCCCGGG7CGGC7CAGAAAA7CCAGC7CG7AAACAC 

CAACGGCAG77GGCACA7CAACAGGAC7GCCC7GAAC7GCAACGAC7CCC7CCAAAC 

AGGG77C777GCCGCAC7A77C7ACAAACACAAA77CAAC7CG7C7GGA7GCCCAGAG 

CGC77GGCCAGC7G7CGC7CCA7CGACAAG77CGC7CAGGGG7GGGG7CCCC7CAC77 

ACAC7GAGCC7AACAGC7CGGACCAGAGGCCC7AC7GC7GGCAC7ACGCGCC7CGACC 

G7G7GG7A77G7ACCCGCG7C7CAGG7G7GCGG7CCAG7G7A77GC77CACCCCGAGCC 

C7G77G7GG7GGGGACGACCGA7CGG777GG7G7CCCCACG7A7AAC7GGGGGGCGAA 

CGAC7CGGA7G7GC7GA77C7CAACAACACGCGGCCGCCGCGAGGCAAC7GG77CGGC 

7G7ACA7GGA7GAA7GGCAC7GGG77CACCAAGACG7G7GGGGGCCCCCCGTGCAACA 

7CGGGGGGGCCGGCAACAACACC77GACC7GCCCCAC7GAC7G7777CGGAAGCACCC 

CGAGGCCACC7ACGCCAGA7GCGG77C7GGGCCC7GGC7GACACC7AGG7G7A7GG77 

CA77ACCCA7A7AGGC7C7GGCAC7ACCCC7GCAC7G7CAAC77CACCA7C77CAAGG7 

7AGGA7G7ACG7GGGGGGCG7GGAGCACAGG77CGAAGCCGCA7GCAA77GGAC7CG 

AGGAGAGCG77G7GAC77GGAGGACAGGGA7AGA7CAGAGC77AGCCCGC7GC7GC7G 

7C7ACAACAGG7GA7CGAGGGCAGACACCA7CACCACCA7CAC7AA7AG 

S5Q ID NO 41 (HCC143) 

A7GG7GGGGAAC7GGGC7AAGG7777GG77G7GA7GC7AC7C777GCCGGCG7CGACG 
GGCA7ACCCGCG7G7CAGGAGGGGCAGCAGCC7CCGA7ACCAGGGGCC77G7G7CCC7 
C777AGCCCCGGG7CGGC7CAGAAAA7CCAGC7CG7AAACACCAACGGCAG77GGCAC 
A7CAACAGGAC7GCCC7GAAC7GCAACGAC7CCC7CCAAACAGGG77C777GCCGCAC 
7A77C7ACAAACACAAA77CAAC7CG7C7GGA7GCCCAGAGCGC77GGCCAGC7G7CG 
C7CCA7CGACAAG77CGC7CAGGGG7GGGG7CCCC7CAC77ACAC7GAGCC7AACAGC 
7CGGACCAGAGGCCC7AC7GC7GGCAC7ACGCGCC7CGACCG7G7GG7A77G7ACCCG 
CG7C7CAGG7G7GCGG7CCAG7G7A77GC77CACCCCGAGCCC7G77G7GG7GGGGAC 
GACCGA7CGG777GG7G7CCCCACG7A7AAC7GGGGGGCGAACGAC7CGGA7G7GC7G 
ATTC7CAACAACACGCGGCCGCCGCGAGGCAAC7GG77CGGC7G7ACA7GGA7GAA7G 
GCAC7GGG77CACCAAGACG7G7GGGGGCCCCCCG7GCAACA7CGGGGGGGCCGGCA 
ACAACACC77GACC7GCCCCAC7GAC7G7777CGGAAGCACCCCGAGGCCACC7ACGC 
CAGA7GCGG77C7GGGCCC7GGC7GACACC7AGG7G7A7GG77CA77ACCCA7A7AGG 
C7C7GGCAC7ACCCC7GCAC7G7CAAC77CACCA7C77CAAGG77AGGA7G7ACG7GGG 
GGGCG7GGAGCACAGG77CGAAGCCGCA7GCAA77GGAC7CGAGGAGAGCG77G7GA 

C77GGAGGACAGGGA7AGA7CAGAGC77AGCCCGC7GC7GC7G7C7ACAACAGAG7GG 
CAGAGC77AA77AA77AG 
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SEQ ID NO 43 (HCC!4^) 

ATGGTGGGGAACTGGGCTAAGGTTT7GGTTGTGA7GCTACTC7TTGCCGGCGTCGACG 
GGCATACCCGCGTGTCAGGAGGGGCAGCAGCCTCCGATACCAGGGGCCT7GTGTCCCT 
C777AGCCCCGGG7CGGC7CAGAAAA7CCAGC7CG7AAACACCAACGGCAG77GGCAC 
A7CAACAGGAC7GCCC7GAAC7GCAACGAC7CCC7CCAAACAGGG77C777GCCGCAC 
7A77C7ACAAACACAAA77CAAC7CG7C7GGA7GCCCAGAGCGC77GGCCAGC7G7CG 
C7CCA7CGACAAG77CGC7GAGGGG7GGGG7CCCC7CAC77ACAC7GAGCC7AACAGC 
7CGGACCAGAGGCCC7AC7GC7GGCAC7ACGCGCC7CGACCG7G7GG7A77G7ACCCG 
CG7C7CAGG7G7GCGG7CCAG7G7A77GC77CACCCCGAGCCC7G77G7GG7GGGGAC 
GACCGA7CGG777GG7G7CCCCACG7A7AAC7GGGGGGCGAACGAC7CGGA7G7GC7G 
A77CTCAACAACACGCGGCCGCCGCGAGGCAACTGGTTCGGCTGTACATGGA7GAATG 
GCAC7GGG77CACC A AG ACG7G7GGGGGCCCCCCG7GC A AC A7CGGGGGGGCCGGC A 
ACAACACC77GACC7GCCCCACTGACTGTTT7CGGAAGCACCCCGAGGCCACC7ACGC 
CAGA7GCGGT7C7GGGCCC7GGCTGACACCTAGG7G7A7GG77CA77ACCCA7A7AGG 
C7C7GGCAC7ACCCC7GCAC7G7CAAC77CACCA7C77CAAGG77AGGA7G7ACG7GGG 
GGGCG7GGAGCACAGG77CGAAGCCGCA7GCAA77GGAC7CGAGGAGAGCG77G7GA 
C77GGAGGACAGGGA7AGA7CAGAGC77AGCCCGC7GC7GC7G7C7ACAACAGG7GA7 
CGAGGGCAGACACCATCACCACCA7CACTAA7AG 



SEQ ID NO 45 (HCCL64) 

A7GG7GGCGGGGGCCCA77GGGGAG7CC7GGCGGGCC7CGCC7AC7A77CCA7GG7GG 
GGAAC7GGGC7AAGG7777GG77G7GA7GC7AC7C777GCCGGCG7CGACGGGCA7AC 
CCGCG7G7CAGGAGGGGCAGCAGCC7CCGA7ACCAGGGGCC77G7G7CCC7C777AGC 
CCCGGGTCGGCTCAGAAAATCCAGC7CGTAAACACCAACGGCAGT7GGCACATCAAC 
AGGAC7GCCC7GAAC7GCAACGAC7CCC7CCAAACAGGG77C777GCCGCAC7A77C7 
ACAAACACAAA77CAAC7CG7C7GGA7GCCCAGAGCGC77GGCCAGC7G7CGC7CCA7 
CGACAAG77CGC7CAGGGG7GGGG7CCCC7CAC77ACAC7GAGCC7AACAGC7CGGAC 
CAGAGGCCC7AC7GC7GGCAC7ACGCGCC7CGACCG7G7GG7A77G7ACCCGCG7C7C 
AGG7G7GCGG7CCAG7G7A77GC77CACCCCGAGCCC7G77G7GG7GGGGACGACCGA 
7CGG777GG7G7CCCCACG7A7AAC7GGGGGGCGAACGAC7CGGA7G7GC7GA77C7C 
AACAACACGCGGCCGCCGCGAGGCAAC7GG7TCGGC7G7ACATGGATGAATGGCACT 
GGG77CACCAAGACG7G7GGGGGCCCCCCG7GCAACA7CGGGGGGGCCGGCAACAAC 
ACC77GACC7GCCCCAC7GAC7G7777CGGAAGCACCCCGAGGCCACC7ACGCCAGA7 
GCGG77C7GGGCCC7GGC7GACACC7AGG7G7A7GG77CA77ACCCA7A7AGGC7C7GG 
CACTACCCCTGCACTGTCAACTTCACCA7CTTCAAGGTTAGGA7GTACGTGGGGGGCG 
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TGGAGCACAGG77CGAAGCCGCATGCAATTGGACTCGAGGAGAGCGTTGTGACT7GGA 
GGACAGGGATAGATCAGAGC77AGCCCGCTGCTGCTGTC7ACAACAGAGTGGCAGATA 
CTGCCCTGTTCCTTCACCACCCTGCCGGCCCTA7CCACCGGCC7GA7CCACCTCCATCA 
GAACA7CG7GGACG7GCAA7ACC7G7ACGG7G7AGGG7CGGCGG77G7C7CCC77G7C 
A7CAAA7GGGAG7A7G7CC7G77GC7C77CC77C7CC7GGCAGACGCGCGCA7C7GCGC 
C7GC77A7GGA7GA7GC7GC7GA7AGC7CAAGC7GAGGCCGCC77AGAGAACC7GG7G 
G7CC7CAA7GCGGCGGCCG7GGCCGGGGCGCA7GGCAC7C777CC77CC77G7G77C77 
C7G7GC7GCC7GG7ACA7CAAGGGCAGGC7GG7CCC7GG7GCGGCA7ACGCC77C7A7 
GGCG7G7GGCCGC7GC7CC7GC77C7GC7GGCC77ACCACCACGAGC77A7GCC7AG7AA 

SEQ ID NO 47 (HCCI65) 

AA777GGG7AAGG7CA7CGA7ACCC77ACA7GCGGC77CGCCGACC7CG7GGGG7ACA 
77CCGC7CG7CGGCGCCCCCC7AGGGGGCGC7GCCAGGGCCC7GGCGCA7GGCG7CCG 
GG77C7GGAGGACGGCG7GAAC7A7GCAACAGGGAA777GCCCGG77GC7C777C7C7 
A7C77CC7C77GGC7TTGC7G7CC7G7C7GACCG77CCAGC77CCGC77A7GAAG7GCG 
CAACG7G7CCGGGA7G7ACCA7G7CACGAACGAC7GC7CCAAC7CAAGCA77G7G7A7 
GAGGCAGCGGACA7GA7CA7GCACACCCCCGGG7GCG7GCCC7GCG77CGGGAGAAC 
AAC7C77CCCGC7GC7GGG7AGCGC7CACCCCCACGC7CGCAGC7AGC-AACGCCAGCG 
7CCCCACCACGACAA7ACGACGCCACG7CGA777GC7CG77GGGGCGGC7GC777C7G 
77CCGC7A7G7ACG7GGGGGACC7C7GCGGA7C7G7C77CC7CG7C7CCCAGC7G77CA 
CCA7C7CGCC7CGCCGGCA7GAGACGG7GCAGGAC7GCAA77GC7CAA7C7A7CCCGG 
CCACA7AACGGG7CACCG7A7GGC77GGGA7A7GA7GA7GAAC7GG7CGCC7ACAACG 
GCCC7GG7GG7A7CGCAGC7GC7CCGGA7CCCACAAGC7G7CG7GGACA7GG7G'GCGG 
GGGCCCA77GGGGAG7CC7GGCGGGCC7CGCC7AC7A77CCA7GG7GGGGAAC7GGGC 
7AAGG7777GG77G7GA7GC7AC7C777GCCGGCG7CGACGGGCA7ACCCGCG7G7CAG 
GAGGGGCAGCAGCC7CCGA7ACCAGGGGCC77G7G7CCC7C777AGCCCCGGG7CGGC 
TCAGAAAA7CCAGCTCG7AAACACCAA.CGGCAG77GGCACA7CAACAGGAC7GCCC7 
GAAC7GCAACGAC7CCC7CCAAACAGGG77C777GCCGCAC7A77C7ACAAACACAAA 
77CAAC7CG7C7GGA7GCCCAGAGCGC77GGCCAGC7G7CGC7CCA7CGACAAG77CG 
C7CAGGGG7GGGG7CCCC7CAC77ACAC7GAGCC7AACAGC7CGGACCAGAGGCCC7A 
C7GC7GGCAC7ACGCGCC7CGACCG7G7GG7A77G7ACCCGCG7C7CAGG7G7GCGG7 
CCAG7G7A77GC77CACCCCGAGCCC7G77G7GG7GGGGACGACCGA7CGG777GG7G7 
CCCCACG7A7AAC7GGGGGGCGAACGAC7CGGA7G7GC7GA7TC7CAACAACACGCGG 
CCGCCGCGAGGCAAC7GG77CGGC7G7ACA7GGA7GAA7GGCAC7GGG77CACCAAGA 
CG7G7GGGGGCCCCCCG7GCAACA7CGGGGGGGCCGGCAACAACACC77GACC7GCC 
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CCACTGACTGTT7TCGGAAGCACCCCGAGGCCACCTACGCCAGATGCGGTTCTGGGCC 

CTGGC7GACACC7AGGTG7ATGGTICA77ACCCATATAGGCTCTGGCACTACCCCTGCA 

CTGTCAACT7CACCATCTTCAAGG7TAGGATG7ACGTGGGGGGCG7GGAGCACAGGT7 

CGAAGCCGCA7GCAA77GGAC7CGAGGAGAGCG77G7GAC77GGAGGACAGGGA7AG 

A7CAGAGCT7AGCCCGC7GC7GC7G7C7ACAACAGAG7GGCAGA7AC7GCCC7G77CC 

T7CACCACCC7GCCGGCCC7A7CCACCGGCC7GA7CCACC7CCA7CAGAACA7CG7GG 

ACG7GCAA7ACC7G7ACGG7G7AGGG7CGGCGG77G7C7CCC77G7CA7CAAA7GGGA 

G7A7G7CC7G77GC7C77CC77C7CC7GGCAGACGCGCGCA7C7GCGCC7GC77A7GGA 

TGA7GC7GC7GA7AGC7CAAGC7GAGGCCGCC77AGAGAACC7GG7GG7CC7CAA7GC 

GGCGGCCG7GGCCGGGGCGCA7GGCAC7C777CC77CC77G7G77C77C7G7GC7GCC7 

GG7ACA7CAAGGGCAGGC7GG7CCC7GG7GCGGCA7ACGCC77C7A7GGCG7G7GGCC 

GC7GC7CC7GC77C7GC7GGCC77ACCACCACGAGC77A7GCC7AG7AAGC77 

SE2 ID NO 49 (HCCI65) 

A7G AG CACG A A7CC7A A ACC7C A A AG AAA A ACC A A ACG7AACACC A ACCGCCGCCC A 
CAGGACG7CAAG77CCCGGGCGG7GG7CAGA7CG77GG7GGAG777ACC7G77GCCGC 
GCAGGGGCCCCAGG77GGG7G7GCGCGCGAC7AGGAAGAC77CCGAGCGG7CGCAAC 
C7CG7GGGAGGCGACAACC7A7CCCCAAGGC7CGCCGACCCGAGGG7AGGGCC7GGG 
C7CAGCCCGGG7ACCC77GGCCCC7C7A7GGCAA7GAGGGCA7GGGG7GGGCAGGA7G 
GC7CC7G7CACCCCGCGGC7C7CGGCC7AG77GGGGCCC7ACAGACCCCCGGCG7AGG 
TCGCG7AA777GGG7AAGG7CA7CGA7ACCC77ACA7GCGGC77CGCCGACC7CG7GG 
GG7ACA77CCGC7CG7CGGCGCCCCCC7AGGGGGCGC7GCCAGGGCCC7GGCGCA7GG 
CG7CCGGG77C7GGAGGACGGCG7GAAC7A7GCAACAGGGAA777GCCCGG77GC7C7 
77C7C7A7C77CC7C77GGCTT7GC7G7CC7G7C7GACCG77CCAGC77CCGC77A7GAA 
G7GCGCAACG7G7CCGGGA7G7ACCA7G7CACGAACGAC7GC7CCAAC7CAAGCA77G 
7G7A7GAGGCAGCGGACA7GA7CA7GCACACCCCCGGG7GCG7GCCC7GCG77CGGGA 
GAACAAC7C77CCCGC7GC7GGG7AGCGC7CACCCCCACGC7CGCAGC7AGGAACGCC 
AGCG7CCCCACCACGACAA7ACGACGCCACG7CGA777GC7CG77GGGGCGGC7GC77 
7C7G77CCGC7A7G7ACG7GGGGGACC7C7GCGGA7C7G7C77CC7CG7C7CCCAGC7G 
77CACCA7C7CGCC7CGCCGGCA7GAGACGG7GCAGGAC7GCAA77GC7CAA7C7A7C 
CCGGCCACA7AACGGG7CACCG7A7GGC77GGGA7A7GA7GA7GAAC7GG7CGCC7AC 
AACGGCCC7GG7GG7A7CGCAGC7GC7CCGGA7CCCACAAGC7G7CG7GGACA7GG7G 
GCGGGGGCCCA77GGGGAG7CC7GGCGGGCC7CGCC7AC7A77CCA7GG7GGGGAAC7 
GGGC7AAGG7777GG77G7GA7GC7AC7C777GCCGGCG7CGACGGGCA7ACCCGCG7 
G7CAGGAGGGGCAGCAGCC7CCGA7ACCAGGGGCC77G7G7CCC7C777AGCCCCGGG 
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TCGGCTCAGAAAATCCAGCTCGTAAACACCAACGGCAG7TGGCACATCAACAGGACT 
GCCCTGAACTGCAACGACTCCC7CCAAACAGGG77CT7TGCCGCACTA7TCTACAAAC 
ACAAATTCAACTCGTCTGGATGCCCAGAGCGCTTGGCCAGCTGTCGCTCCATCGACAA 
GT7CGCTCAGGGG7GGGGTCCCC7CACTTACACTGAGCCTAACAGCTCGGACCAGAGG 
CCC7AC7GC7GGCAC7ACGCGCC7CGACCG7G7GG7A77G7ACCCGCG7C7CAGG7G7 
GCGG7CCAG7G7A77GC77CACCCCGAGCCC7G77G7GG7GGGGACGACCGA7CGG77 
TGG7G7CCCCACG7A7AAC7GGGGGGCGAACGAC7CGGA7G7GC7GA77C7CAACAAC 
ACGCGGCCGCCGCGAGGCAAC7GG77CGGC7G7ACA7GGA7GAA7GGCAC7GGG77CA 
CCAAGACG7G7GGGGGCCCCCCG7GCAACA7CGGGGGGGCCGGCAACAACACC77GA 
CC7GCCCCAC7GAC7G7777CGGAAGCACCCCGAGGCCACC7ACGCCAGA7GCGG77C 
7GGGCCC7GGC7GACACC7AGG7G7A7GG77CA77ACCCA7A7AGGC7C7GGCAC7AC 
CCC7GCAC7G7CAAC77CACCA7C77CAAGG77AGGA7G7ACG7GGGGGGCG7GGAGC 
ACAGG77CGAAGCCGCA7GCAA77GGAC7CGAGGAGAGCG77G7GAC77GGAGGACA 
GGGA7AGA7CAGAGC77AGCCCGC7GC7GC7G7C7ACAACAGAG7GGCAGA7AC7GCC 
C7G77CC77CACCACCC7GCCGGCCC7A7CCACCGGCC7GA7CCACC7CCA7CAGAAC 
A7CG7GGACG7GCAA7ACC7G7ACGG7G7AGGG7CGGCGG77G7C7CCC77G7CA7CA 
AA7GGGAG7A7G7CC7G77GC7C77CC77C7CC7GGCAGACGCGCGCA7C7GCGCC7GC 
77A7GGA7GA7GC7GC7GA7AGC7CAAGC7GAGGCCGCC77AGAGAACC7GG7GG7CC 
7CAA7GCGGCGGCCG7GGCCGGGGCGCA7GGCAC7C777CC77CC77G7G77C77C7G7 
GC7GCC7GG7ACA7CAAGGGCAGGC7GG7CCC7GG7GCGGCA7ACGCC77C7A7GGCG 
7G7GGCCGC7GC7CC7GC77C7GC7GGCC77ACCACCACGAGC77A7GCC7AG7AA 



WO 96/04385 




PCT/EP95/03031 



33/59 
Figure 22 



OD measured at 450 imi 
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OD measured at 450 nm 
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Figure 23: Western Blot Analysis with anti-El mouse monoclonal 5E1A10 
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FIGURE 30 
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Figure 38 

Relative Map Positions of 
anti-E2 monoclonal antibodies 
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Figure 39 
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Figure 40 
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SIGNATURE ^^V^iT , . DATE _ 



FTCJSBJ 10 (11-50) Paienr and Trademark OfEec; UX DEPARTMENT OF COMMERCE 



Docket Number (Optimal) 



Applicant or Patentee Geert M ABRTSire F flns aP e M . K - 
Serial or Paten: No: _PCT/EPgg /n»n^ — 




A* a below named in venter I herahv Wp-i„~ +. * 

sU the application ideadSed above. 
C die patent icendfied above. 

concen: under 37 G* ^ or a nonprofit aSSS^S^^ 1 ^ * ***** 

-^-^^^^^ 

U No^hpers^c^cer^ororsani^nex^ 

^Fml ^ CQQCSni OT 3r 2*^°n is list- below 
FULL NAME : BINOCEiiBrccS If V " 
ADDRESS : ^Bi,^^ 

S^LL BHSI^S-^lcEr 1 ' ^ 
Separate verified oaieacas are rtsniired fram . 

^ **** » *« stn, a, W^Sc?^^^ or orsaxnadon having rights to the ^ 

S-Se^^^^^^ S^SS ^ ttd *" * — - on inform 

Untad SaMa code, and that seen ^d^S^L^S^ «*«*edon iMi of T :tk -Sof *T 
«mni ***** or any pat** to wS^S^SI^ ^ **** ^appScanon. any 



Geert H AERTENS 



Fans 



Guy - eg MAHTYNOPF 





NAME OF INVENTOR 

SxfSJttSK of xzxvcQsar 



NAME OF INVENTOR 



RUL£S3(37 0FiLl-S3) 
DECLARATION AND POWER OF ATTORNEY 
FOR PATENT APPLICATION 
IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

As a beiow named inventor, i hereby declare mat my residence, post af5=e address and citizenship are as sUtcd below next to my name, and J 
bcSeve I am the arfgaiaJ first and scie inventor (If onfy one name listed oeiew) or an orfcinai, fist and joint inventor (ff plural names are 5st»d 
below) ofme subject matter which s ciairned and fee which a patent is sought an me invention emflJed: 

PURIFIED HEPATITIS C VIHUS SKVcLOPS PflOTEINS.FOR DIAGNOSTIC AftB THSHAPSUTIC USE 
ihe specification af which (check apptaibte box(s}): 
is attached hereto 

was fled en as U.S. AppScaOon Serial No. " • 



was fled as PCT imernaticnal appficetfon Na. PCT/£?95/03Q3- on July 31, 1995 

awTd (if apptabie to U.S. or PCT application) was amended on 



I ftentay state ttmt ! hava reviewed and undens&nd the contents of the above identified specification, including the ctotms, as amended by any 
amendment referred to above. I acknowledge me duty to disctose rntbrmsSen which « material to the patentabifiry of mis app£cat»n in accordance 
wflft 37 Cf .R. ! hereby daim foreign priority benefits under 35 U.S.C. 1 12/365 cf any foreign appficatSon(s) for patent or inventors certificate 
feted beiow and nave ate identified below any foreign application far patent or inventors sertiScate having a ffinc; cats before that of the application 
on which pnortfy is etaimed or. if no priority is claimed, before Hie $in£ data of -lis apptatiorr 

Prtor Foreign Appficaifcn{s); 

Application Ntsnber Country Qay/Moftmrfear Ftied 

9^870132.1 EU3CPS 29 July 199^ 

f hereby daim the benefit under 35 U.S.C 12G/2S5 of ail prtar Unfed States and PCT Intematfanai applications Gsted above or heiowano, insofar as 
me subject matter of each of me cSeims of mis appfcaticn is no: aaciased in such prtor applications in the manner provided by me first paragraph of 
35 UJS.C. 112, 1 acknowledge me dirty to disclose materia* inforrnaten as defined in 37 Cr.R. 1-56 which occurred between the flRng date of me 
jjprior applications and me rrattonaf or PCT Intematfenaf ffing date of mis appScaSoru 

'^ppSaOBnS^A^ Day/Mcnth/Year riled pending, abandoned 

:j pCT/£?95/03C31 31 July 1995 .pending 

:; 4 nereby deciare mat aU statements made herein of my dvm taowieds* are mje and mat an statements made on information and beiief are heGeved 
: ? fe be true; and former that mese statements were made wim me knowledge mat wiffitii iaise statements and the like so made are punishable by tSne 
" or imprisonment; or bom, under Section 1001 ofT2ie 18 of me United States Code and mat swen wiWUl fefcse statements may jeopardize me vanity 
Sf me appGcaticn or any patent issued thereon- And f hereby appoint NKOM A VAKOERHYE 1100 Norm GJebe RcU Mi Root, ArSngtcn, 
VA 2220i-*7l4, teJephcna number (703) aiS-40flfi (to whom afl exsnvnunicafions are to be directed), and the fbitowinfl attorneys thereof (of me 
r »me address} individuaiiy and ccfectivefy my attorneys to pmsecute ma application and to transact all business in me Patent and TrademarX Office 
'""connected therewfm and with me resufing patent Arthur R. Crawford, 35327; Larry S. Nbesn. 2S64C; Hahert X Vanderhye, 2707S; James T 
■ •riosmer, 301A4; Robert W. Fans, 31352, Richanl G. Seshe, 2277C; S. Nusbaum, 32343; Michael J. Keenan, 32106; Bryan H. Davidson, 30251; 
ffittnley C. Spooner. 273S3; Leonard C. Mitcrard, 2S0Q9: Duane M Byers, 33353; Paul J. Henon, 33S2S; Jenry H. Neison, 30481; John ft Ustova, 
• ; api4S; H. Warren Sumam, Jr.. 2S3S6; Thomas 9yrn*» WaiV 1 'A&on, 32955; J. Scott Oavfdson. 334ft9. 

'% Inventors Signature: ^—^^y \ ^ ^ . Date: ^V^/ ^ ^ 

inventor Ceegf ^ 7 ^ ' ; MAERTSNS Belgian 

prstj MI Past) (catensh'p) 

Residence: (cay) B-8310 BRUGGE 3 (sate/country) Belgium 

Past Office Address: ' 2ilvers-parrengrraac 64 . B-8^10 BRUGGS 3 , Belgium . 

(Zip Code) 3-P^ij 




Inventof s Signature; >ffi/£gS^' b Oak- 

inventor ? 3^^t< BCSMAN ✓ ^^Belgian 

Ml Qstst) (csfeertship) 

Residence: (c2y) B- 1-7^5 0P.v1JK Cstaie/cstintry) Belgium ■ 

Post Office Address: Kulst 165, B-1745 OPWUK. Belgium " ^ 



(2p Code} 5-17115 
Inventors signature; 



inventor: Guy *~~V -^-^H^~ ^ £ MAHTYN0F? J Bfelgiaa 



(first) *^ MT (last) (citizenship) 

Residence: (city) B-141Q WAT5RLC0 {state/country) Belgium ■ - , 

post Office Address: Martotstraat 71, B-1410 WATERLOO, Belgium 

(T-p Code) B-miQ 



ADfimCMAJL fNVEafTORS, check box j and attach sheet with some mfberation and signature and date for each. 




k^t9tSotm Signature 

{%} B-9820 MERSLBEK3 



^ — °~ — &m _ 

_ (ttsQ Belgian 

PwtOflfce Asanas: » — STr^TT^r- — Z-"% — ""^ frtMttasunliy) Belgium 



for cacti. 



